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NUCLEIC SEQUENCES ENCODING AN AT2 RECEPTOR- INTERACTING 
PROTEIN (ATIP) AND THEIR APPLICATIONS 

The present invention relates to nucleic 
5 sequences encoding a protein capable of interacting 
with the AT2 receptor, to oligonucleotides contained in 
the said sequences, to their applications as probes and 
for the expression of the said proteins, to the vectors 
useful for the said expression, to the host cells 
10 containing the said vectors and to a model for studying 
the AT2 receptor. 

The present invention also relates to the said 
proteins and to their applications. 

The octapeptide, angiotensin II, mainly known 
15 as a regulator of blood pressure, has also been 
described as an important modulator of cell growth. 
Interestingly, this peptide appears to exert opposite 
effects on cell growth according to whether it is bound 
to one or the other of its two subtypes of membrane 

2 0 receptors (ATI or AT2) . 

The AT2 receptor subtype, which also belongs to 
the G protein-coupled receptor family, is still poorly 
characterized both from the point of view of its 
mechanisms of activation and its physiological role (C. 
25 Nahmias et al . , Trends Pharmacol Sci, 1995, 16, 223- 
225) . Several arguments suggest, however, a role for 
this receptor in the phenomena of cell proliferation, 
differentiation or adhesion. 

The AT2 receptor is highly expressed during 

3 0 foetal life, disappears in adults in most tissues, but 

becomes reexpressed under pathophysiological conditions 
involving restructuring of the tissues. 

Studies carried out in vivo have demonstrated 
the inhibitory role exerted by the AT 2 subtype on the 
35 proliferation of the muscle cells of the tunica intima 
vasorum after vascular lesion (P. Janiak et al . , 
Hypertension, 1992, 20, 737-745; M Naka j ima et al . , 
Proc. Natl. Acad. Sci. USA, 1995, 92, 10663-10667). 
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Moreover, the stimulation of the AT2 receptor 
activates phosphatase SHP-1 (Bedecs K. , et al; Biochem. 
J., 1997, 325, 449-454). The fact that the AT2 receptor 
activates a phosphatase is consistent with its 
5 antiproliferative effects. 

In the light of the above, it has been shown 
that, on cells in culture, the AT2 receptor: 

inhibits the synthesis of DNA and 
proliferation, which are induced by angiotensin II 
10 (Ang II) and bFGF (M. Stoll et al., J . Clin. Invest., 
1995, 95, 651-657) , 

- induces apoptosis <T. Yamada et al . , Proc. 
Natl. Acad. Sci. USA, 1996, 93, 156-160), and 

- induces neuronal differentiation (L. Laflamme 
15 et al., J*. Biol. Chem. , 1996, 271, 22729-22735). 

Studies of the signalling pathways associated 
with the AT2 receptor have been undertaken in cells of 
the N1E-115 line which are derived from a murine 
neuroblastoma and which express only the AT 2 subtype. A 

2 0 first study has made it possible to demonstrate rapid 

and transient dephosphorylation of some proteins on the 
tyrosine residues following the treatment of N1E-115 
cells with angiotensin II (C. Nahmias et al . , Biochem. 
J. , 1995, 306, 87-92) . It has also been shown that the 
25 AT2 receptor interferes with the pathways for 
activation of growth factor receptors and inhibits the 
activity of MAP kinases (ERK1 and ERK2), (mitogen- 
activated protein), which play a key role in the 
phenomena of cell proliferation and differentiation. 

3 0 The inhibitory effect of AT 2 on the activation of MAP 

kinases is rapid and transient, does not involve a 
regulatory protein sensitive to the pertussis toxin (of 
the Gi/Go type) , but involves the activation of an 
orthovanadate-sensitive tyrosine phosphatase. 
35 Taking into account the role of the AT2 

receptor in cell proliferation, the inventors have 
sought to develop tools capable of regulating the 
action of the AT 2 receptor. Indeed, the activation of 




the AT2 receptor may have repercussions in cancerology 
(inhibition of cell proliferation) . 

In general, the AT2 receptor has opposite 
effects to those of ATI on the activation of MAP 
5 kinases and on cell proliferation; study of the 
communication which may exist between these two 
receptor subtypes, which bind the same ligand, is 
consequently of interest. 

The study of the signalling pathways and of the 
10 regulation of the AT 2 receptor also represents a major 
stake for human health knowing that antagonists of the 
ATI receptor are currently administered to patients 
with hypertension. In this context, it is essential to 
know the biological effects associated with the AT 2 
15 receptor which remains activable by circulating Ang II 
in this type of treatment. 

The subject of the present invention is an 
isolated nucleic acid (DNA or RNA) fragment, encoding a 
protein capable of binding to the AT 2 receptor, which 
20 fragment is selected from the group consisting of the 
sequences SEQ ID NO:l, 3, 5, 7 and 9, as represented in 
the sequence listing included in the present 
application. 

These various sequences correspond to the 
25 complementary DNA (cDNA) encoding all or part of the 
protein called hereinafter ATIP (AT2 interacting 
protein) . 

The sequence SEQ ID NO:l (1803 bp) corresponds 
to the complete nucleic sequence of mouse ATIP and 

3 0 includes both the parts encoding the AT 2 receptor 
binding protein and the noncoding parts . 

The sequence NO: 3 (132 3 bp) corresponds to the 
nucleic acid sequence of the coding part of the 
sequence SEQ ID NO:l, while the sequence SEQ ID NO: 5 

35 corresponds to the sequence NO:l fragment obtained by 
the two-hybrid technique (A Plessis et al . , M/S, 1994, 
9, I-1K; J. Luban et al . , Curr. Op. Biotechnol . , 1995, 
6, 59-64) . 
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The sequence SEQ ID NO: 7 (3742 bp) corresponds 
to the complete nucleic sequence of the human cDNA and 
includes both the parts encoding the protein homologous 
to the mouse ATIP and the noncoding parts. 
5 The sequence SEQ ID NO: 9 (1308 bp) corresponds 

to the coding part of the sequence SEQ ID NO: 7. 

The subject of the present invention is also 
transcripts, characterized in that they are 
complementary to the sequences in accordance with the 
10 invention and are in particular generated from the said 
sequences . 

The subject of the present invention is, in 
addition, fragments of the said sequences comprising 
between 20 and 400 bp, useful as probes or as primers, 

15 for the detection of the sequences SEQ ID NO:l, 3, 5, 7 
or 9, or of homologous sequences. 

Among the said fragments, there may be 
mentioned in particular a probe of 354 bp (SEQ ID NO: 5) 
as well as any fragment of 20 bp to 400 bp included in 

20 the sequences SEQ ID NO:l, 3, 5, 7 or 9 . 

As primer, there will be used in particular the 
sequence SEQ ID NO: 10 (antisense oligonucleotide) which 
makes it possible in particular to amplify the 5' parts 
of the various mRNAs corresponding to ATIP (5' RACE 

25 technique: Marathon cDNA amplification kit, Clontech) . 

It is also possible to use, as amplification 
primers, any pair of oligonucleotides of more than 20 
bp and comprising part of the ATIP (human or mouse) 
nucleic sequence, in particular the pair SEQ ID NO: 11- 

3 0 SEQ ID NO: 12 . 

The preferred hybridization (prehybridization 
and hybridization) conditions are in particular the 
following: 45% formamide, 9% dextran sulphate, 0.2% 
BSA, 0.2% polyvinyl pyrrolidone, 0.2% Ficoll, 0.1% 

35 sodium pyrophosphate, 0.01% SDS, 0.05 mM Tris pH 7.5, 
0.9 M NaCl and rinses to a stringency corresponding to 
the buffer: 1XSSC, 0.1% SDS. 
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The subject of the present invention is also a 
purified and isolated protein, called ATIP, which is 
capable of interacting with the AT2 receptor and which 
is selected from the group consisting of the sequences 
5 SEQ ID NO: 2, 4, 6 or 8. 

The murine and human sequences exhibit 85.6% 
homologies. The human sequence (human ATIP) possesses 5 
amino acids less than the mouse sequence (mouse ATIP) . 
The amino acids missing from the human sequence are 
10 situated at the level of amino acids: 162, 163, 164, 
166 and 214 of the mouse ATIP sequence. 

Comparisons (Blast) between the ATIP protein 
sequences according to the invention and the sequences 
contained in data banks indicate that human ATIP (like 
15 mouse ATIP) never exhibits more than 25% homology with 
a known sequence, and this being the case only over 
part of this sequence. 

The subject of the present invention is also a 
translational product, characterized in that it is 
2 0 encoded by a nucleotide sequence in accordance with the 
invention. 

The- subject of the present invention is, in 
addition, antibodies, characterized in that they are 
directed against the ATIP protein or an ATIP protein 
25 fragment according to the invention. 

The subject of the present invention is also a 
recombinant cloning and/or expression vector, 
characterized in that it comprises a nucleotide 
sequence in accordance with the invention. 
30 The subject of the present invention is also a 

transformed host cell, characterized in that it 
comprises a vector as defined above. 

Among the preferred transformed cells according 
to the invention, there may be mentioned E. coli and 
35 CHO cells. 

The subject of the present invention is also 
transformed host cells, characterized in that they 
consist of a suitable yeast strain cotransf ormed with 



at least two vectors which respectively encode (i) a 
so-called bait protein selected from the group 
consisting of a fragment containing at least SEQ ID 
NO: 5 of the ATIP protein and a fragment containing at 
5 least the C- terminal end of the AT2 receptor, which 
bait protein is fused with a protein selected from the 
group consisting of the DNA-binding domain of a 
transcription factor and the activation domain of the 
same transcription factor and (ii) a. so-called prey 

10 protein, selected from the group consisting of a 
fragment containing at least SEQ ID NO: 5 of the ATIP 
protein, a fragment containing at least the C- terminal 
end of the AT 2 receptor and any other polypeptide 
corresponding to a sequence contained in a cDNA 

15 library, which prey protein is fused with a protein 
selected from the group consisting of the DNA-binding 
domain of a transcription factor and the activation 
domain of the same transcription factor, which vectors 
comprise, in addition, selectable markers. 

2 0 According to an advantageous embodiment of the 

said cells, they consist in particular of: 

- either a suitable yeast strain cotransf ormed 
with three vectors which respectively encode (i) a bait 
corresponding to a fragment containing the C- terminal 

25 end of the AT2 receptor fused with a protein selected 
from the group consisting of the DNA-binding domain of 
a transcription factor and the activation domain of the 
said transcription factor, (ii) a fragment containing 
at least SEQ ID NO: 5 of the ATIP protein according to 

30 the invention, fused with a protein selected from the 
group consisting of the DNA-binding domain of a 
transcription factor and the activation domain of the 
said transcription factor and (iii) a polypeptide 
corresponding to a sequence contained in a cDNA 

35 library, which vectors comprise, in addition, 
selectable markers, 

- or a suitable yeast strain cotransf ormed with 
two vectors which respectively encode (i) a fragment 
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containing at least SEQ ID NO: 5 of the ATIP protein 
according to the invention, fused with a protein 
selected from the group consisting of the DNA-binding 
domain of a transcription factor and the activation 
5 domain of the said transcription factor and (ii) a 
polypeptide corresponding to a sequence contained in a 
cDNA library, fused with a protein selected from the 
group consisting of the DNA-binding domain of the 
transcription factor and the activation domain of the 

10 said transcription factor, which vectors comprise, in 
addition, selectable markers, 

- or a suitable yeast strain cotransf ormed with 
two vectors, namely (i) a vector encoding a fragment 
containing at least SEQ ID NO: 5 of the ATIP protein, 

15 mutated or not, fused with a protein selected from the 
group consisting of the DNA-binding domain of a 
transcription factor and the activation domain of the 
said transcription factor and (ii) a vector encoding a 
fragment containing the C- terminal end of the AT 2 

20 receptor, mutated or not, fused with a protein selected 
from the group consisting of the DNA-binding domain of 
a transcription factor and the activation domain of the 
said transcription factor, which vectors comprise, in 
addition, selectable markers, one of the two vectors 

25 necessarily encoding a mutated protein. 

The subject of the present invention is also a 
method for selecting proteins inhibiting ATIP protein 
according to the invention-AT2 receptor interaction, 
which method comprises: 

30 (a) cotransforming a suitable yeast strain with 

three vectors which respectively encode (i) a bait 
corresponding to a fragment containing the C- terminal 
end of the AT 2 receptor fused with a protein selected 
from the group consisting of the DNA-binding domain of 

35 a transcription factor and the activation domain of the 
said transcription factor, (ii) a fragment containing 
at least SEQ ID NO: 5 of the ATIP protein according to 
the invention, fused with a protein selected from the 
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group consisting of the DNA-binding domain of a 
transcription factor and the activation domain of the 
said transcription factor and (iii) a polypeptide 
corresponding to a sequence contained in a cDNA 
5 library, which vectors comprise, in addition, 
selectable markers, 

(b) selecting the clones of cDNA library 
expressing a polypeptide inhibiting the AT2 receptor- 
ATIP protein according to the invention interaction, on 

10 an appropriate selective medium, and 

(c) identifying the said polypeptide. 

Such a method uses in particular the so-called 
reverse two-hybrid or three-hybrid technique as 
described in Vidal et al . {Proc. Natl, Acad. Sci. USA, 
15 1996, 93, 10315-10320 and 10321-10326) or Tirode et al . 
(J\ Biol. Chem., 1997, 272, 37, 22995-22999). 

The subject of the present invention is also a 
method for screening polypeptides interacting with the 
ATIP protein according to the invention, which method 
20 comprises: 

(a) cotransf orming a suitable yeast strain with 
two vectors as defined above, namely which respectively 
encode (i) a fragment containing at least SEQ ID NO: 5 
of the ATIP protein according to the invention, fused 

25 with a protein selected from the group consisting of 
the DNA-binding domain of a transcription factor and 
the activation domain of the said transcription factor 
and (ii) a polypeptide corresponding to a sequence 
contained in a cDNA library, fused with a protein 

3 0 selected from the group consisting of the DNA-binding 
domain of a transcription factor and the activation 
domain of the said transcription factor, which vectors 
comprise, in addition, selectable markers, and 

(b) . selecting the clones expressing a 
35 polypeptide interacting with the ATIP protein, on a 

suitable selective medium. 

Such a method makes it possible in particular 
to search for other proteins interacting with the ATIP 
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protein, in particular in order to find the next links 
in the pathway activated by the AT2 receptor, so as to 
use them to modify the protein according to the 
invention-AT2 receptor interaction. 
5 The subject of the present invention is also a 

method for characterizing the domains involved in the 
ATIP protein-AT2 receptor interaction, characterized in 
that it comprises: 

(a) cotransf orming a suitable yeast strain with 
10 two vectors, as defined above, namely (i) a vector 

encoding a fragment containing at least SEQ ID NO: 5 of 
the ATIP protein, mutated or not, fused with a protein 
selected from the group consisting of the DNA-binding 
domain of a transcription factor and the activation 

15 domain of the said transcription factor and (ii) a 
vector encoding a fragment containing the C- terminal 
end of the AT2 receptor, mutated or not, fused with a 
protein selected from the group consisting of the DNA- 
binding domain of a transcription factor and the 

20 activation domain of the said transcription factor, 
which vectors comprise, in addition, selectable 
markers, one of the two vectors necessarily encoding a 
mutated protein, and 

(b) visualizing, by selection on a suitable 
25 selective medium, the possible loss of the ATIP-AT2 

receptor interaction. 

Such a method makes it possible to identify and 
to delimit the important domains of the ATIP protein or 
of the C- terminal end of the AT 2 receptor, on which 
30 their interaction depends, so as to use them as 
preferred target for modifying the AT 2 receptor 
signalling. 

The subject of the present invention is also a 
method for selecting substances capable of influencing 
3 5 the ATIP protein according to the invention-AT2 
receptor interaction, which method comprises: 

(a) bringing the ATIP protein, attached to a 
support, into contact with a fusion protein AT 2 
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receptor-protein tag, optionally in the presence of a 
substance to be tested, 

(b) at least one washing of the said support 
thus treated with a suitable buffer, and 

5 (c) visualizing the possible ATIP-AT2 receptor 

interaction, in particular in SDS-PAGE, followed by 
immunoblotting with antibodies directed against the 
protein tag, fused with the AT 2 receptor, or against 
the AT2 receptor. 
10 If the substance to be tested inhibits the 

ATIP-AT2 receptor interaction, the visualization step 
is negative. 

In accordance with the invention, ATIP is 
attached to the said support either covalently, or 
15 through affinity binding between an attachment 
substance fused with ATIP and the said support. For 
example, the said support consists of beads coupled 
either to a substance having affinity with the said 
attachment protein, fused with ATIP, or to suitable 

2 0 antibodies. 

The fusion protein AT2 receptor-protein tag is 
in particular obtained from a lysate of cells 
transfected with a vector expressing the fusion protein 
AT2 -protein tag. 
25 As a variant, the said method for selecting 

substances capable of interacting with the ATIP protein 
according to the invention comprises: 

(a) bringing the ATIP protein, attached to a 
support, into contact with a cell lysate, 

3 0 (b) at least one washing of the said support 

thus treated with a suitable buffer, 

(c) visualizing the possible protein combined 
with the ATIP protein, in particular in SDS-PAGE, 
followed by immunoblotting with appropriate antibodies, 

3 5 and 

(d) identifying the protein in the cell lysate 
interacting with the ATIP protein. 



In accordance with the said method for 
selecting substances capable of influencing the ATIP 
protein according to the invention-AT2 receptor 
interaction, it is possible to use in particular, as 
fusion proteins ATIP-protein tag, the proteins GST- 
ATI Pc and MYC-ATIPc, which constitute tools which can 
make it possible to bring down in vitro any proteins 
interacting with ATIP, for example, from cell lysates 
activated or otherwise with ligands for the AT 2 
receptor. The GST-ATIP protein may be brought down by 
specific interaction of GST with agarose beads coupled 
to glutathione, or alternatively immunoprecipitated 
with the ant i -ATIP antibody. The Myc- AT IP protein may 
be immunoprecipitated with commercial anti-MYC 
antibodies or with the ant i -ATIP antibody. 

The advantage of these methods consists in 
finding means of modifying the signalling, the level of 
expression or the pharmacology of the AT2 receptor, 
which may have therapeutic applications. Indeed, when a 
pathological condition has been clearly correlated with 
a transduction abnormality associated with the AT2 
receptor, modification of this transduction, in 
particular by acting on the binding of the AT2 receptor 
to the protein according to the invention, may then 
possibly compensate for the pathological disorder or at 
least influence it. 

The subject of the present invention is also 
the use of the abovementioned cotransf ormed cells for 
the selection and screening of substances or of 
proteins capable of influencing the ATIP protein-AT2 
receptor interaction or capable of interacting with the 
ATIP protein. 

In addition to the preceding features, the 
invention also comprises other features which will 
emerge from the description which follows, which refers 
to exemplary embodiments of the method which is the 
subject of the present invention as well as to the 
accompanying drawings, in which: 
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- Figure 1 corresponds to the C- terminal end of 
the mouse AT2 receptor, used as a two-hybrid bait for 
screening a mouse cDNA library; 

- Figure 2 illustrates the position of the 
5 GAL4 -binding domain and the multiple cloning site of 

the plasmid pGBT9 (Clontech) / 

- Figure 3 illustrates the presumed coiled-coil 
structures (coiled-coil domains underlined) of mouse 
ATIP; 

10 - Figure 4 illustrates the presumed coiled-coil 

structures (coiled-coil domains underlined) of human 
ATIP; 

- Figure 5 illustrates the structure of the 
plasmid pVP16; 

15 - Figure 6 illustrates the multiple cloning 

site of the plasmid pRSET A; 

- Figure 7 illustrates the MCY sequence used to 
construct the plasmid pcDNA3-MYC; 

- Figure 8 illustrates the structure of the 
20 plasmid pBAC-PAK-poly HIS; 

Figure 9 illustrates a Northern blot of 
several human tissues hybridized with the probe 
ATIPmouse-short (SEQ ID N0:5); 

Figure 10 illustrates the interaction in 
25 vitro of the protein ATI Pmouse- short with the C- 
terminal end of the AT2 receptor; and 

- Figure 11 illustrates the modifications of 
the signal induced by the AT 2 receptor by 
overexpression of the ATIP protein. 

3 0 It should be clearly understood, however, that 

these examples are given solely by way of illustration 
of the subject of the invention and do not constitute 
in any manner a limitation thereto. 
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EXAMPLE 1 ; Demonstration of a specific protein-protein 
interaction between the AT 2 receptor and the protein 
having the sequence SEQ ID NO: 6 according to the 
invention 
5 Materials and methods 

- The two-hybrid system, initially developed by 
Song and Fields in 1989 (Nature, 340, 245-246) is based 
on the fact that the activity of numerous eukaryotic 
transcription-activating factors requires only two 

10 domains: an activating domain which does not have the 
capacity to bind DNA and a DNA-binding domain. 

In the two-hybrid system, the DNA-binding 
domain is fused with a protein X and the activation 
domain is fused with a protein Y. If, and only if, X 

15 and Y interact, a complex is formed which reconstitutes 
a functional transcription factor. 

- Construction of the expression vectors: 
. "bait" vectors: 

Protein X: O terminal end of the sequence 

2 0 encoding the mouse AT 2 receptor (52 amino acids of 

CVNPF at the stop codon, see Figure 1) , fused with the 
sequence encoding the Gal4 DNA-binding domain (Figure 
2) . 

Insert: end of the mouse AT2 receptor (159 bp + 
25 16 bp of sites generated by PCR) inserted at the level 
of the EcoRI and BamHI sites of the vectors pLEX9 
(Clontech) or pGBT9 (modified pGAD424 or pBTM116; A.B 
Vojtek et al . , Cell, 1993, 74, 205-214). 

The following sequence is thus obtained: 

3 0 CGGAATTC on the 5' side-AT2 C- terminal sequence of 52 

amino acids -GGATCCCG 3' side 
. screened library: 

mouse foetal cDNA library (A.B. Vojtek et al . , 
Cell, 1993, 74, 205-214), containing inserts of 350 to 
35 700 bp (protein Y) in the vector VP16 (Figure 5) . 
. "Bait" control vectors 

Protein X: C- terminal end of the human 02- 
adrenergic receptors, rat ATI or human bradykinin. 
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. Transformed yeast strain 

HF7c (Clontech) for the bait constructed in 

pGBT9 ; 

L40 for the bait constructed in pLex9. 
5 Results 

This strategy made it possible to isolate a 
clone derived from the cDNA library containing an 
insert of 354 bp (ATIP) which interacts specifically 
with the C- terminal end of AT2 . It is of interest to 

10 note that the screening of this library with the 
constructs produced in the two expression vectors pGBT9 
and pLEX9 made it possible to find this same clone in 
both cases. This clone does not interact with control 
proteins exhibiting nonspecific interactions. 

15 To evaluate the selectivity of this 

interaction, the ATIP clone was tested as a two-hybrid 
system with the C-terminal ends of the receptors: human 
p2 adrenergic, rat ATI and human bradykinin, and all 
gave negative results. This indicates that the 

20 polypeptide encoded by the ATIP clone interacts, in a 
selective manner, with the C-terminal end of the mouse 
AT2 receptor. 

EXAMPLE 2 : Characterization of the ATIP clone 

To test for the corresponding whole clone, a 
25 probe of 354 bp (SEQ ID NO:5), which corresponds to the 
insert obtained by digestion with the restriction 
enzyme Not I of the plasmid isolated in a two-hybrid 
system (that extracted from the VP16 library, selected 
as being positive in the screen using, as bait, the C- 
3 0 terminal end of the mouse AT2 receptor) , is used to 
screen a mouse foetal cDNA library constructed with 
inserts of more then 1 kb in size. Two overlapping 
clones, comprising the ATIP sequence, were thus 
identified and made it possible to sequence 1803 bp of 
35 the corresponding cDNA (SEQ ID NO:l) . This sequence 
contains an open reading frame of 1323 bp (SEQ ID 
NO:3), potentially encoding a protein of 440 amino 
acids (SEQ ID NO : 2 and 4). Comparisons between the 
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identified protein sequence and the sequences contained 
in data banks indicate that it never exhibits more than 
25% homology with a known sequence part. 

The 3 54 bp probe (SEQ ID NO: 5) was used as 
5 probe in Southern and Northern in a very satisfactory 
manner under the hybridization conditions below: 
prehybridization and hybridization in 45% formamide, 9% 
dextran sulphate, 0-2% BSA, 0.2% polyvinylpyrrolidone, 
0.2% Ficoll, 0.1% sodium pyrophosphate, 0.01% SDS, 0.05 

10 mM Tris pH 7.5, 0.9 M NaCl and rinses to stringency: 1 
x SSC, 0.1% SDS. 

In parallel, Northern blot hybridization 
experiments carried out on total RNAs of 'N1S-115 cells 
with the ATIP probe (SEQ ID NO: 5) confirm the 

15 expression of the corresponding mRNA in the N1E-115 
cells, and indicates the existence of at least 5 
transcripts of different sizes. These transcripts 
correspond to alternative splicings of the same gene or 
to different homologous genes. 

2 0 On a Northern, performed under the conditions 

described in the literature on a 5 jag sample of poly A+ 
RNA of N1E-115 cells, the sizes of the various 
transcripts hybridizing with the ATIPmouse probe are = 
2.5-3.5-5-5.3 and 7.5 kb. 

2 5 Figure 9 represents a Northern blot containing 

poly A+ RNAs of various human tissues, hybridized with 
the same ATIPmouse probe. It is possible to observe 
that ATIP is ubiquitously expressed. A predominant 
transcript at 4.4 kb is found in all the tissues 

3 0 represented, to which there are added, according to the 

tissues, other longer transcripts (pancreas and heart) 
or shorter transcripts (pancreas, skeletal muscle, 
placenta, brain and heart) . These are perhaps the fruit 
of an alternative splicing of the ATIP RNA which would 
3 5 be dependent on the tissue considered or alternatively 
they are the sign of the existence of an RNA family 
encoding proteins of the "ATIP family" homologous to 



ATIP and which are revealed by the probe, at the 
stringency used. 

To know the size of the smallest transcript 
encoding ATIP, a rapid amplification of the cDNA ends 
5 (5' RACE, Marathon cDNA Amplification Kit from 
Clontech) from poly A+ RNA of N1E-115 cells was carried 
out using the antisense oligonucleotide of SEQ ID 
NO: 10, to amplify the 5' parts of the various mRNAs 
corresponding to the endogenous ATIP of the N1E-115 

10 cells (murine neuroblastoma) . 

The results obtained indicated that the 
smallest transcript including the ATIP domain is an 
mRNA of 1950 bp, which indeed contains the start of the 
coding sequence obtained by cloning. 

15 Any other pair of oligonucleotides (primers) of 

more than 20 bp and comprising part of the ATIP 
sequence may also be used to amplify, by PCR (PCR 
conditions to be determined for each pair of 
oligonucleotides with the aid of the OLIGO 4 software) , 

20 part of the ATIP (and to give a DNA fragment which may 
be optionally used as a probe to recognize the DNA or 
the RNA corresponding to the ATIP) . 

EXAMPLE 3 Construction of various vectors according to 
the invention 

25 In general, the vectors containing ATIPmouse- 

short (with the exception of pRSETA-ATIPmouse -short) 
were obtained from an insert produced by PCR with the 
following two oligonucleotides (SEQ ID NO: 11 and SEQ ID 
NO: 12) : 

3 0 oligo. sense: 5' CGCGGATCCCAGACAGACCGGACGGAACTGGAG3 ' 

oligo . antisense : 5 ' CCGGAATTCACTACAACCTTTCGTTTAAAGCATC 
3' , 

using as template the vector VP16-ATIPmouse- 
short (Figure 5) . For the sake of convenience, this 
35 vector is called B ATIPc stop ' E . Indeed, digested with BamHI 
and EcoRI, it gives an insert corresponding to the 
sequence 
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1st strand: GATCC-SEQ ID N0:5 (minus CAT) -TAGTG 

2nd strand: CCTAG CTTAAG 

[STOP) 



BamHI site EcoRI site 

Other vectors may also be constructed; they 
comprise all or part of the ATIP protein and are the 
following: 

5 -VP16-ATIPmouse- short (vector taken from the 

library screened in the two-hybrid system, comprises 
354 bp (SEQ ID NO : 5 ) , inserted in NotI into VP16) , 

-pCDNA3 -MYC- ATIPmouse- short (insert a ATIPc stop ' E , 
inserted in BamHI -EcoRI into pCDNA3-MYC (pcDNA3 from 

10 Invitrogen, modified by insertion of the MYC sequence, 
Figure 7) ; this plasmid may be used in stable or 
transient transf ections . It makes it possible to 
express MYC -ATI Pmouse- short in eukaryotic cells. The 
expression of this protein in eukaryotic cells after 

15 transfection of the corresponding plasmid has already 
been obtained and checked by immunoreaction with an 
ant i -MYC and anti-ATIP antibody. 

-pRSETA-HIS-ATIPmouse- short (insert B ATIPc stop ' E , 
inserted in BamHI -EcoRI into pRSETA, Invitrogen) . This 

20 plasmid makes it possible to express the fusion protein 
HIS -ATIPmouse- short in bacterial cells and to purify it 
on a nickel column (see Figure 6 for the multiple 
cloning site) . 

-pBacPAK-polyHIS -ATIPmouse - short ( insert 

2 5 B ATIPc scop ' E , inserted in BamHI -EcoRI into the vector 

pBacPAK-polyHIS (commercial pBacPAK, modified by 
insertion of a sequence containing a histidine tag and 
a site for cleavage with thrombin, Figure 8). This 
construct may be used to express the ATI Pmouse -short 
30 protein, fused with a histidine tag, in insect cells 
(SF9 type) . Indeed, as indicated, this vector contains 
a poly-histidine insert and can therefore encode the 
fusion protein. The latter, like the fusion protein 
cloned into pRSET, may be purified on a nickel column 

3 5 and may serve in the same type of techniques. 
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-pGEX-4Tl -GST -ATIPmouse- short (insert amplified 
by the PCR identical to B ATIPc stop ' E , but with no STOP 
codon, which extends the ATIPmouse -short sequence by 
the few amino acids which follow: Phe-Glu-Phe-Pro-Gly- 
5 Arg-Leu-Glu-Arg-Pro-His-Arg-Asp obtained from the 
plasmid pGEX-4T-l (Pharmacia) . This plasmid makes it 
possible to express the protein GST -ATIPmouse- short in 
bacterial cells and to purify it on glutathione-agarose 
beads . 

10 -pCDNAI -ATIPmouse clonel (entire 5' sequenced 

from ATIP and ORF up to bp: 1205 starting from the 
beginning of the clone, inserted in Bstxl into pCDNAI) . 
This plasmid is derived from the cloning of the mouse 
foetal library with the probe SEQ ID NO: 5. This plasmid 

15 can serve to produce, in bacteria, the 5' portion of 
the ATIPmouse DNA, so as to use it as a probe. 

-pCDNAI -ATIPmouse clone2 (2nd half of the ORF 
of ATIP from bp: 616 and up to the end of the 
3'sequenced (bp 1803), inserted in Bstxl into pcDNAI, 

20 Invitrogen) . This plasmid can serve to produce, in 
bacteria, the 3' portion of the ATIPmouse DNA, so as to 

use it as a probe. 

-pCDNAI -ATIPmouse -long (clones 1 and 2 placed 
end to end, using the intermediate Sapl site. This 

25 plasmid contains the entire ATIPmouse clone, inserted 
in Bstxl into pCDNAI) . This plasmid may be used in 
transient transf ections in eukaryotic cells. 

-pCDNA3 -ATIPmouse- long (whole ATIPmouse from 
BamHI-Xbal of pCDNAI -ATIPmouse -long, and inserted into 

3 0 pcDNA3, Invitrogen, at these same sites) . This plasmid 
may be used in stable or transient transf ections in 
eukaryotic cells. It made it possible to translate in 
vitro (kit TNT T7 coupled reticulocytes lysate systems, 
Promega) the whole ATIP protein and to observe that its 

35 translational. product has an apparent molecular weight 
on gel of 58 kDa.'^Added to this predominant product are 
two minor products of 30 and 15 kDa. According to the 
ATIP sequence, these could correspond to partial 
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products of translation in vitro starting with ATGs 
other than that at position 178 of SEQ ID N0:1. 
EXAMPLE 4 ; Production of stable clones expressing the 
ATIPmouse- short or long protein 

5 Stable clones expressing both the human AT2 

receptor and ATIPmouse- short (SEQ ID NO: 6) or 
ATIPmouse- long (SEQ ID NO: 3) were obtained by 
transf ection. 

CHO cells, deficient in dihydrof olate 

10 reductase, are transfected with a piasmid containing 
the region encoding the human AT 2 receptor (Bedecs et 
al., Biochem. J. 1997, 325, 449-454). 

The clone selected, CHO-hAT2, expressing 100 
fmol of AT2 receptor/mg of protein, is cultured on an 

15 HAMF12 medium supplemented with 10% foetal calf serum 
and used between passages 10 and 30. 

This clone was itself transfected with the 
plasmids pCDNA3-MYC-ATIPmouse- short or pCDNA3- 
ATIPmouse-long described in Example 3. The selection of 

20 the clones stably expressing the ATIP protein (short 
form or long form) was carried out in a selective 
medium containing 800 jig/ml of G418 . The cell lysates, 
corresponding to the various selected clones, were 
subjected to SDS-PAGE followed by immunoblotting and 

25 this was incubated with the ant i -ATIP polyclonal 
antibody. The results obtained indicate that various 
clones expressing various levels of ATIPmouse -short 
were able to be obtained. 

EXAMPLE 5 ; Production of polyclonal antibodies directed 
3 0 against the SEQ ID NO: 6 sequence 

To progress in the characterization of this 
clone, the production of polyclonal antibodies directed 
against the ATIP domain was undertaken. 

For that, a vector encoding a protein 
3 5 corresponding to. this domain fused with six histidine 
residues was constructed. 

The following sequence: 

GGA TCC-SEQ NO : 5-TAG-TGA-ATT 



is inserted into the plasmid pRSETA, as defined above. 

In this insert, SEQ ID NO: 5 does not comprise 
the first CAT. 

The plasmid obtained is expressed in the E. 
5 coli strain BL 21 (DE3) (F" ompT" r B m B ") containing the 
bacteriophage DE3 which carries a DNA fragment 
containing the lad gene, the lacUVS promoter, the 
start of the lacZ gene and the gene encoding T7 RNA 
polymerase . This fragment is introduced into the int 
1 0 gene . 

In the presence of DE3 , only the lacUV5 
promoter, inducible by IPTG directs the transcription 
of T7 RNA polymerase. 

The addition of 0.4 mM IPTG to a culture of 
15 BL21 (DE3) cells induces the production of T7 RNA 
polymerase which, in turn, causes the transcription of 
the target DNA of the plasmid pRSETA (which allows the 
translation of the protein binding to the AT 2 
receptor) . 

20 The protein obtained (17 kDa) is purified on a 

nickel column (Ni-NTA, QuiAexpressionist 07/97, 
Quiagen) , by virtue of the affinity of its six 
histidine residues for nickel. The protein obtained is 
then injected into rabbits so as to obtain polyclonal 

25 antibodies directed against the ATIP protein. The 
bleedings obtained have a very good titre. 

These antibodies, purified on a GST-ATIP 
column, after passing through a GST column alone (so as 
to remove possible GST-specific antibodies and to 

3 0 retain on the GST-ATIP column only the antibodies 
specific for ATIPmouse- short ) may be used successfully 
to immunoprecipitate and reveal in immunoblotting MYC- 
ATIPmouse- short from transiently transfected COS cells. 
Furthermore, this purified antibody also reveals in 

35 immunoblotting the ATIPmouse -long protein contained in 
lysates of COS cells transiently transfected with the 
plasmid pCDNA3 -ATIPmouse- long. 
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The transfected ATI Pmouse- long protein is 
visualized after SDS-PAGE and immunoblotting with an 
anti-ATIP antibody, in the form of two polypeptides 
having apparent molecular weights of 50 and 4 5 kDa. 
5 This purified antibody was used in 

immunofluorescence on CHO-hAT2 cells, fixed by a 15- 
minute treatment with paraformaldehyde (3%) . After 
fixing, the cells are successively treated with 
solutions of PBS/glycine 50 mM for 20 minutes, 

10 PBS/Triton X100 0.1% for 5 minutes and PBS/BSA 0.2% for 
15 minutes. They are then successively incubated in 
solutions containing 15 jig/ml of antibody containing 
the purified anti-ATIP antibody, and then the anti- 
rabbit immunoglobulin antibody coupled to rhodamine for 

15 30 minutes. Between each new incubation, three rinses 
in PBS are carried out. Observations under a 
fluorescence microscope indicate an expression of the 
endogenous ATIP protein in the nucleus (predominantly) 
and in the cytoplasm of the CHO-hAT2 cells. 

20 Some cells show a homogeneous distribution of 

the fluorescence due to the anti-ATIP antibody in these 
compartments, whereas other cells which appear more 
spread out, show a heterogeneous distribution of the 
fluorescence along the filaments which appear to start 

25 from the nucleus and spread up to the plasma membrane 
of the cell, in an organized network. Additional 
colocalization experiments should be carried out to 
determine if these filaments coincide or otherwise with 
known structures of the cytoskeleton. 

3 0 EXAMPLE 6 : Confirmation of the in vitro interaction of 
the ATIPmouse- short protein with the C- terminal end of 
the AT2 receptor 

To demonstrate the interaction of the 
ATIPmouse -short protein with the C-terminal end of the 

3 5 AT 2 receptor in a system other than that of the two- 
hybrid system, a protocol which makes it possible to 
demonstrate this interaction in vitro was set up. For 
that, the fusion protein GST-ATIP as described above 
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was produced; it is combined through its GST part with 
glutathione coupled to agarose beads (GA) . In parallel, 
bacteria (DH5a) are transfected with a plasmid (pMAL- 
C2-AT2, derived from pMAL-c2 from New England Biolabs) 
5 encoding a fusion protein between the C-terminal end of 
the human AT2 receptor (Asn314-Ser363 ) and MBP (Maltose 
Binding Protein) . These bacteria were cultured and the 
fusion protein was induced in 0.3 mM IPTG according to 
the protocol "pMAL Protein Fusion and Purification 

10 System" from New England Biolabs. After centrif ugation 
of the culture at 4 000 g and solubilization of the 
pellet obtained in "column buffer" (20 mM Tris-HCl, 200 
mM NaCl, 1 mM EDTA) , another centrif ugation at 9 000 g 
made it possible to recover a supernatant containing a 

15 high concentration of MBP-AT2. This supernatant was 
brought into contact, for 3 hours at 4°C, with 
glutathione agarose beads coupled to GST protein alone 
after addition of NaCl so as to have 300 mM final NaCl . 
This preincubation step makes it possible to remove the 

20 nonspecific interactions which may exist between ATIP 
and GA-GST . The supernatant recovered was brought into 
contact with the GA-GST-ATIPmouse-short or GA-GSTalone 
beads overnight at 4°C. After contact, the beads were 
rinsed three times in 20 mM Tris-HCl buffer, 300 mM 

25 NaCl, ImM EDTA and once in "column buffer". After 
analysing the beads rinsed in SDS-PAGE and 
immunoblotting with an anti-MBP antibody (New England 
Biolabs) , a specific retention of the MBP-AT2 protein 
is observed on GA-GST-ATIPmouse-short beads which is 

3 0 not observed on the GA-GSTalone beads (Figure 10) . 

This same protocol was carried out with a 
plasmid expressing MBP -ATI (C-terminal end of human ATI 
receptor (Leu 297-Glu 359)); it indicates that the MBP- 
AT1 protein is not retained in a specific manner on the 

35 GA-GST-ATIPmouse-short beads (Figure 10) . 

These results confirm those obtained in the 
two-hybrid system indicating a specific and selective 
interaction between the protein according to the 
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invention and the C- terminal end of the AT 2 receptor 
(and not ATI) . 

EXAMPLE 7 : Modification of the transduction of the 
signal for the AT 2 receptor in clones overexpressing 
5 the ATIPmouse-long protein 

To verify that the ATIP protein interacts in 
vivo with the AT2 receptor, it was evaluated whether an 
overexpression of this protein modifies a signal 
induced by the AT2 receptor. 

10 For that, a stable clone of CH0-hAT2 cells 

expressing the ATIPmouse-long protein (CHO-hAT2 -ATIP) , 
obtained according to the methodology described in 
Example 4, was used; the functional test for the 
activity of the AT 2 receptor developed on the CHO-hAT2 

15 clone which consists in inhibiting the phosphorylation 
of the IRP subunit of the insulin receptor induced by 
its ligand, was reproduced. 

Demonstration of an inhibition by the AT 2 
receptor of the phosphorylation of IR(i induced by 

20 insulin in CH0-hAT2 cells: 

The CH0-hAT2 cells are inoculated at a density 
of 3xl0 6 cells per dish having a diameter of 15 cm 2 . 
They are made quiescent by 16 hours of deprivation 
before being treated. The treatment consists in 

25 bringing into contact for 5 minutes with 15 ml of F12 
medium containing insulin supplemented or otherwise 
with CGP42112 (selective agonist of the AT2 receptor) . 
After treatment, the cells are solubilized in lysis 
buffer containing: 50 mM Hepes, pH 7.6, 1% Triton X- 

30 100, 20 mM EDTA, 3 0 mM sodium pyrophosphate, 3 0 mM 
sodium fluoride, 2 mM benzamidine, 1 mM sodium 
orthovanadate, 1 mM phenylmethyl sulphonyl fluoride and 
1 jig /ml of aprotinin, pepstatin, antipain and 
leupeptin. The lysates are then subjected to 

35 purification on a wheatgerm lectin column, according to 
the protocol described in Issad, T. et al . , (Eur. J. 
Biochem. 1995, 234, 108-115) . After bringing into 
contact and washings, the lectin beads coupled to 
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Sepharose (Pharmacia) are recovered in sample buffer 
containing SDS and the eluted proteins are analysed in 
SDS-PAGE followed by immunoblotting with anti- 
phospho tyrosine antibodies (Upstate Biotechnology, 
5 Inc.) or anti-IRP antibodies (described in Issad, T. et 
al . , cited above) . 

The P subunit of the insulin receptor appears 
as a polypeptide of 97 kDa whose phosphorylation 
(visualized by revealing with an anti-phosphotyrosine 

10 antibody) increases in a dose -dependent manner with the 
concentration of insulin. Angiotensin II (100 nM) as 
well as CGP42112 (100 nM) inhibit this phosphorylation 
at all the insulin doses tested between 0.1 and 0.001 
Hg/ml (Figure 11) . By way of example, CGP42112 inhibits 

15 the phosphorylation of IRp induced by 0.01 (ig/ml by a 
factor of 64 ± 4% (n=7) . This result demonstrates that 
the AT2 receptor interferes negatively with the 
signalling pathways for the insulin receptor at the 
initial stage of its activation, which is its 

2 0 autophosphorylation. These results also provide the 

first evidence of an interconnection between the 
signalling pathways for the tyrosine kinase receptors 
and the receptor with seven transmembrane domains which 
is AT2 . 

25 Reproduction of this methodology on CHO-hAT2- 

ATIP cells: 

When this protocol is carried out on CH0-hAT2- 
ATIP cells, the inhibition by CGP42112 (100 nM) of the 
phosphorylation of the insulin receptor obtained for 
30 various doses of insulin (0.05, 0.01, 0.005, 0.001 
Hg/ml) is not observed (Figure 11) . This result was 
reproduced 3 times for each of the insulin doses 
taking, as positive control in each experiment, the 
inhibition obtained for the clone CHO-hAT2 . 

3 5 This therefore demonstrates that the 

overexpression of the ATIP protein in the CHO-hAT2 
cells interferes with the AT 2 receptor signalling, 



which confirms the interaction in vivo of the ATIP 
protein with the AT2 receptor. 

Another glycosylated protein, retained on a 
lectin column, having an apparent weight of 12 0 kDa, 
5 identified as being the newly cloned protein SIRP 
(Kharitonenkov, A. et al . , Nature, 1997, 386, 181-186) 
is phosphorylated on tyrosine in response to insulin. 
The phosphorylation of this protein, as well as that of 
IRP is inhibited in the presence of CGP42112 in the 

10 case of the clone CHO-hAT2 and is not in the case of 
the clone CHO-hAT2-ATIP . This confirms that the ATIP 
protein interferes with the signalling pathways for the 
AT2 receptor. These results indeed show the possible 
value of the use of the ATIP protein for modifying 

15 signalling mediated by the AT2 receptor and for 
possibly compensating for pathological conditions 
associated with abnormalities in the regulation of this 
receptor. 

As is evident from the above, the invention is 
20 not at all limited to those of its embodiments, 
implementations and applications which have just been 
described more explicitly; it encompasses, on the 
contrary, all the variants thereof which may occur to 
the specialist in the field, without departing from the 
2 5 framework or the scope of the present invention. 
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CLAIMS 



1. Isolated nucleic acid fragment, encoding a 
protein capable of binding to the AT 2 receptor, which 

5 fragment is selected from the group consisting of the 
sequences SEQ ID NO:l, 3, 5, 7 and 9. 

2 . Fragment of one of the sequences according to 
Claim 1, comprising between 20 and 400 bp, useful as 
probes or as primers, for the detection of the 

10 sequences SEQ ID NO:l, 3, 5, 7 or 9, or of homologous 
sequences . 

3. Fragment according to Claim 2, characterized in 
that it comprises from 20 bp to 400 bp included in' the 
sequences SEQ ID NO:l, 3, 5, 7 or 9 . 

15 4. Fragment according to Claim 2 or Claim 3, 

characterized in that it is selected from the group 
consisting of the sequences SEQ ID NO: 5, SEQ ID NO: 10, 
SEQ ID NO: 11 and SEQ ID NO : 12 . 

5. Transcripts, characterized in that they are 
20 complementary to the sequences according to Claim 1. 

6. Purified and isolated protein, which is capable 
of interacting with the AT2 receptor and which is 
selected from the group consisting of the sequences SEQ 
ID NO: 2, 4, 6 or 8 , which protein is called ATIP. 

2 5 7. Translational product, characterized in that it 

is encoded by a nucleotide sequence according to Claim 
1. 

8. Antibodies, characterized in that they are 
directed against a protein or a protein fragment 

30 according to Claim 6 or Claim 7. 

9. Recombinant cloning and/or expression vector, 
characterized in that it comprises a nucleotide 
sequence according to Claim 1 . 

10. Transformed host cell, characterized in that it 
35 comprises a vector according to Claim 9. 

11. Transformed host cells, characterized in that 
they consist of a suitable yeast strain cotransf ormed 
with at least two vectors which respectively encode (i) 



a so-called bait protein selected from the group 
consisting of a fragment containing at least SEQ ID 
NO: 5 of the ATIP protein according to Claim 6 or Claim 
7, and a fragment containing at least the C- terminal 
5 end of the AT2 receptor, which bait protein is fused 
with a protein selected from the group consisting of 
the DNA-binding domain of a transcription factor and 
the activation domain of the same transcription factor 
and (ii) a so-called prey protein, selected from the 

10 group consisting of a fragment containing at least SEQ 
ID NO: 5 of the ATIP protein according to Claim 6 or 
Claim 7, a fragment containing at least the C- terminal 
end of the AT2 receptor and any other polypeptide 
corresponding to a sequence contained in a cDNA 

15 library, which prey protein is fused with a protein 
selected from the group consisting of the DNA-binding 
domain of a transcription factor and the activation 
domain of the same transcription factor, which vectors 
comprise, in addition, selectable markers. 

20 12. Transformed host cell according to Claim 11, 

characterized in that it consists of a suitable yeast 
strain cotransf ormed with three vectors which 
respectively encode (i) a bait corresponding to a 
fragment containing the C- terminal end of the AT2 

25 receptor fused with a protein selected from the group 
consisting of the DNA-binding domain of a transcription 
factor and the activation domain of the said 
transcription factor, (ii) a fragment containing at 
least SEQ ID NO: 5 of the ATIP protein according to 

30 Claim 6 or Claim 7, fused with a protein selected from 
the group consisting of the DNA-binding domain of a 
transcription factor and the activation domain of the 
said transcription factor and (iii) a polypeptide 
corresponding to a sequence contained in a cDNA 

35 library, which vectors comprise, in addition, 
selectable markers. 

13. 'fransf ormed host cell according to Claim 11, 

characterized in that it consists of a suitable yeast 



- 28 - 

strain cotransf ormed with two vectors which 
respectively encode (i) a fragment containing at least 
the sequence SEQ ID NO: 5 of the ATIP protein according 
to Claim 6 or Claim 7, fused with a protein selected 
5 from the group consisting of the DNA-binding domain of 
a transcription factor and the activation domain of the 
said transcription factor and (ii) a polypeptide 
corresponding to a sequence contained in a cDNA 
library, fused with a protein selected from the group 
10 consisting of the DNA-binding domain of a transcription 
factor and the activation domain of the said 
transcription factor, which vectors comprise, in 
addition, selectable markers. 

14. Transformed host cell according to Claim 11, 

15 characterized in that it consists of a suitable yeast 
strain cotransf ormed with two vectors, namely (i) a 
vector encoding a fragment containing at least the SEQ 
ID NO: 5 of the ATIP protein sequence according to Claim 
6, mutated or not, fused with a protein selected from 

20 the group consisting of the DNA-binding domain of a 
transcription factor and the activation domain of the 
said transcription factor and (ii) a vector encoding a 
fragment containing the C- terminal end of the AT 2 
receptor, mutated or not, fused with a protein selected 

25 from the group consisting of the DNA-binding domain of 
a transcription factor and the activation domain of the 
said transcription factor, which vectors comprise, in 
addition, selectable markers, one of the two vectors 
necessarily encoding a mutated protein. 

30 15. Method for selecting proteins inhibiting ATIP 

protein according to Claim 6 or Claim 7-AT2 receptor 
interaction, which method comprises: 

(a) cotransf orming a suitable yeast strain with 
three vectors which respectively encode (i) a bait 

3 5 corresponding to a fragment containing the C-terminal 
end of the AT 2 receptor fused with a protein selected 
from the group consisting of the DNA-binding domain of 
a transcription factor and the activation domain of the 



said transcription factor, (ii) a fragment containing 
at least SEQ ID NO: 5 of the ATIP protein according to 
Claim 6 or Claim 7, fused with a protein selected from 
the group consisting of the DNA-binding domain of a 
5 transcription factor and the activation domain of the 
said transcription factor and (iii) a polypeptide 
corresponding to a sequence contained in a cDNA 
library, which vectors comprise, in addition, 
selectable markers, 

10 (b) selecting the clones of cDNA library 

expressing a polypeptide inhibiting the AT2 receptor- 
ATIP protein according to Claim 6 or Claim 7 
interaction, on an appropriate selective medium, and 
(c) identifying the said polypeptide. 

15 16. Method for screening polypeptides interacting 

with the ATIP protein according to Claim 6 or Claim 7, 
which method comprises: 

(a) cotrans forming a suitable yeast strain with 
two vectors as defined above, namely which respectively 

20 encode (i) a fragment containing at least SEQ ID NO: 5 
of the ATIP protein according to Claim 6 or Claim 7, 
fused with a protein selected from the group consisting 
of the DNA-binding domain of a transcription factor and 
the activation domain of the said transcription factor 

25 and (ii) a polypeptide corresponding to a sequence 
contained in a cDNA library, fused with a protein 
selected from the group consisting of the DNA-binding 
domain of a transcription factor and the activation 
domain of the said transcription factor, which vectors 

30 comprise, in addition, selectable markers, and 

(b) selecting the clones expressing a 
polypeptide interacting with the ATIP protein according 
to Claim 6 or Claim 7, on a suitable selective medium. 
17. Method for characterizing the domains involved 

35 in the ATIP protein-AT2 receptor interaction, 
characterized in that it comprises: 

(a) cotransf orming a suitable yeast strain with 
two vectors, namely (i) a vector encoding a fragment 
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containing at least SEQ ID NO: 5 of the ATIP protein 
according to Claim 6, mutated or not, fused with a 
protein selected from the group consisting of the DNA- 
binding domain of a transcription factor and the 
5 activation domain of the said transcription factor and 
(ii) a vector encoding a fragment containing the C- 
terminal end of the AT 2 receptor, mutated or not, fused 
with a protein selected from the group consisting of 
the DNA-binding domain of a transcription factor and 

10 the activation domain of the said transcription factor, 
which vectors comprise, in addition, selectable 
markers, one of the two vectors necessarily encoding a 
mutated protein, and 

(b) visualizing, by selection on a suitable 

15 selective medium, the possible loss of the ATIP protein 
according to Claim 6 or Claim 7-AT2 receptor 
interaction. 

18. Method for selecting substances capable of 
influencing the ATIP protein according to Claim 6 or 

20 Claim 7-AT2 receptor interaction, which method 
comprises : 

(a) bringing the ATIP protein according to 
Claim 6 or Claim 7, attached to a support, into contact 
with a fusion protein AT 2 receptor-protein tag, 

25 optionally in the presence of a substance to be tested, 

(b) at least one washing of the said support 
thus treated with a suitable buffer, and 

(c) visualizing the possible ATIP protein 
according to Claim 6 or Claim 7-AT2 receptor 

30 interaction, in particular in SDS-PAGE, followed by 
immunoblotting with antibodies directed against the 
protein tag, fused with the AT2 receptor. 

19. Method for selecting substances capable of 
interacting with the ATIP protein according to Claim 6 

35 or Claim 7, characterized in that it comprises: 

(a) bringing the ATIP protein according to 
Claim 6 or Claim 7, attached to a support, into contact 
with a cell lysate, 



(b) at least one washing of the said support 
thus treated with a suitable buffer, 

(c) visualizing the possible protein combined 
with the ATIP protein, in particular in SDS-PAGE, 
followed by immunoblotting with appropriate antibodies, 
and 

(d) identifying the protein in the cell lysate 
interacting with the ATIP protein. 

20. Use of the cotrans formed cells according to any 

one of Claims 10 to 13, for the selection and screening 
of substances or of proteins capable of influencing the 
ATIP protein-AT2 receptor interaction or capable of 
interacting with the ATIP protein. 
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(54) Title: NUCLEIC SEQUENCES CODING FOR AN AT2 INTERACTING PROTEIN INTERACTING WITH THE AT2 RECEPTOR 
AND THEIR APPLICATIONS 

(54) Titre: SEQUENCES NUCLEIQUES CODANT POUR UNE PROTEINE (ATTP) INTERAGISSANT AVEC LE RECEPTEUR AT2 
ET LEURS APPLICATIONS 



(57) Abstract 

The invention concerns nucleic sequences coding 
for a protein capable of interacting with the AT2 
receptor, oligonucleotides included in said sequences, 
their applications as probes and for expressing said 
proteins, vectors useful for said expression, host cells 
containing said vectors, and study model of AT2 
receptor. The invention also concerns said proteins and 
their uses. Said isolated nucleic acid fragment coding 
for a protein capable of binding with the AT2 receptor 
is selected among the group consisting of the sequences 
SEQ ID NO: 1, 3, 5, 7 and 9. 



C-TERMINAL END AT2 RECEPTOR 
LOCUS Extr6mit6 C-terminale recepteur AT2 160 BP DS-ONA 

ORGANISM Souris MOUSE 

BASES 41 A 33 C 36 G 50 T 

ac.nuCl6iques 1 TGTGTTAATC CCTTCCTGTA TTGTTTTGTT GGAAACCGCT 
NUCLEIC ACIDS TCCAACAGAA CGTCCGCAGT GTGTTTAGAG TTCCCATTAC 
TTGGCTCCAA GGCAAGAGAG AGACTATGTC TTGCAGAAAA 
121 GGCAGTTCTC TTAGAGAAAT GGACACCTTT GTGTCTTAAA 

TRANSLATION INTO AMINOACIUS 
Traduction en acides amin6s 

CVNPFLYC?V GNRFQQNVRS VFRVPITWLQ GKRETMSCRK 
GSSLREMDTFVS* 



(57) Abrege 

Sequences nucl&ques codant pour une protdine 
apte a interagir avec le recepteur AT2, oligonucleotides 
compris dans lesdites sequences, leurs applications en 
tant que sondes et pour T express ion desdites prolines, 
vecteurs utiles pour ladite expression, hates cellulaires 
contenant lesdits vecteurs, ainsi qu'un mod61e d'6tude 
du recepteur AT2. Prolines ainsi que leurs applica- 
tions. Ledit fragment d'acides nucl6iques isole\ codant 
pour une proteine apte a se lier au recepteur AT2, est 
s61ectionn6 dans le groupe constitu6 par les s6quences 
SEQ ID NO:l, 3, 5, 7 et 9. 
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LOCUS AT 2 receptor C- terminal end IcC 5? DS -DMA 

ORGANISM Mouse 

BASES 41 A 33 C 3£ G 50 T 

Nucleic acids 1 TGTGTTAATC CCTTCCTGTA TTCT7TTGTT GGAAAGCGC7 
TCCAACAGAA CG7CCGCAGT GTCTTTAGAG TTCCCATTAC 
TTGGCTCCAA GGCAAGAGAG AGACTATGTC T7GCAGAAAA 
121 GGCAGTTCTC TTAGACAAAT GGACACC7TT GTCTC7TAAA 



Translation into amino acids 



CWPFLYCFV GMRFQQWVRS VTRVPITWLfc GKRETMSCRK 
GS5LREMDTFVS- 
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CAA ACC CTG AAA CCT CTC TTA GAG ATC AAG AAT GAG AAA CTG CAT CAA CAG GAC I">6< 

eSLKAVL£rXNE»CtH3GC 32: 

i 

ATC AAG TTA ATC AAA ATG CAC AAA CTG CTG CAC AAC -AC ACA CCA TTG GTT GAC U15 

XKLMKHEXLV3NNTALV8 341 

AAA TTC AAC CGT TTC CAC CAC GAG AAT CAA CAA TTG AAA C^T CCG ATG CAC AAC 13 53 
K L _ K P. r* Q 0 E W ~ E L X A. R a C X 3^9 

CAC ATG CCA ATC TCA AGG CAC CTT TCC ACC GAG CAC- CCT CTT CTG CAA GAG TCG 14 23 
HM A £5%QLS?£0AVLC ^ s J 77 

CTG GAC AAC GAC TCC AAA CTC AAC AAG CGA CTC 7C? ATC CAA AAC CAC GAG CTT 14"* 7 
t E KESXVNKaLSMtNZ EL 395 

CTG TCG AAA CTG CAC AAT GGG GAC CTG TGT ACC CCC AAC At A TCC CCC ACA TCC 152 1 
LWKLHNGDE.C5PXr.5F75 413 

TCC CCC ATC CC7 TTC CAG TCA CCA AGG AAT TC3 GCC TCC TTC CCT ACC CCC AGC 
5 A I PCQSPRNSGSrPS ? S 431 

ATT TCA CCC AGA TCA cacgzccccaaag =ecacagactcc"3aaag4;a" tcgacgcag?cccgc 1551 
I S P R I 43c 

aggactgaccceaaggaggaacgrgggcaeaagaggcacatcagcicacgcgcgatcaccgtaggcaaccgs 1723 

Agr5Xcaccaccggcggaa::cgagcctc=gagaciggaa?tccggagsaagacfctCLgcccccjL=caaa*g 1755 

#cc<:ccccauAAdAgacccaaAAdaaga= = £cggcat*;4cacggac?ttgc|:gcacaaagcrdc:=aaaga 1967 
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aegagagcaecctgtccaczjeczcccccacctaagcacaaggggaaaaacictcagggcjrcsactaagac; 193 9 
cacaacccccgcaacg-tc;-caccacagacaccctc=cgcgagcr:::agi=t7accg=ggggg-ggggg5 2C11 
tgcgaacgaaAtgsa-gccaca.gagcgccacgcT-ccgatgcage<:ii=-ccg^cgcffcacca*ac?=caai 20 8 3 
a=ccgaacaca^ccggacac?saccaaccaaacaacaaccaacca4;j*gcaza=acaczt*agccaaagcc 2 IE 5 
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Lgtgagcctaacactaccccgcaacccacss^c^caggcracgcgcaiacgcapaaccccaacccccccaca 2443 
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cccccgacctacccccgcagaccc;aacc=fft:ctc=ctcc;zc==a!:-»ic:cctzc;ccc-i = ;acccc-ci 2 537 
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cc=cccaatcccgagaccztggaaaaggcggaaggaagaaccgc:gcc£::acccccccc-ccc = ~c*cg::g:: 30 91 
caacaccgcgacgi£agcaiccacca*;ccacac = cjgcggccguacaaaca > |cagcr5-ag-aacTaa^a^a 3153 
cccaggacgccfigaggcgaAcacccgggccacrcacacgcacaccacacagc^agc-gacaccca-gzwgca 3 2 3 5 
tgtccsctcaAAttagegacsttgtgzcctaagCczccaacCwCcadCAiCCiaiwacgzazg^acccrcrc 310? 
atgcicgcccc-gacaaacggaaacg-aggcccacigccacccsacgagacacctczgcccacgcLcccaag 33 79 
tcgcccccaatgacatcagccaaagcrgggct^gccaticaiccrc-aggcaaggcaaaccticg-^g- -g-icc 3 45L 
cccgccgccccccgcactacgcgaccggcaaacaaa;cicaiagcagccaaiicaaaacac = ^Lzggagga- 3 5 23 
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Declaration and Power of Attorney for Patent Application 
Declaration et Pouvoirs pour Demande de Brevet 
French Language Declaration 



En tant l'inventeur nommc ci-aprcs, je declare 
par le present acte que : 

Mon domicile, mon adresse postale et ma 
nationality sont ceux figurant ci-dessous a c6t£ 
de mon nom. 

Je crois Stre le premier inventeur original et 
unique (si un seul nom est mentionne ci- 
dessous), ou Tun des premiers co-inventeurs 
originaux (si plusieurs noms sont mentionnes 
ci-dessous) de l'objet revendique, pour lequel 
une demande de brevet a 6t€ d6posee 
concernant V invention intitule 



As a below named inventor, I hereby declare 
that: 

My residence, post office address and 
citizenship are as stated next to my name. 



I believe I am the original, first and sole 
inventor (if only one name is listed below) or 
an original, first and joint inventor (if plural 
names are listed below) of the subject matter 
which is claimed an for which a patent is 
sought on the invention entitled 



Nucleic sequences coding for an AT2 
interacting protein interacting with the AT2 
receptor and their applications 



et dont la description est fournie ci-joint a 
moins 

□ ci-joint 

PI a 6t£ ddposee le 

sous le num£ro de demande des 
Etats-Unis ou le numero de demande 
international PCT 

et modified le 

(le cas e'ehfont). 

Je declare par le present acte avoir pass6 en 
revue et compris le contenu de la description 
ci-dessus, revendications comprises, telles que 
modifies par toute modification dont il aura 
et6 fait references ci-dessus. 



the specification of which ; 

|~~| is attached hereto. 

[x] was filed on 

as United States Application Number or 
PCT International Application Number. 
PCT/FR99/01908 filed on August 2, 1999 

and was amended on 

(if applicable). 

I hereby state that I have reviewed and 
understand the contents of the above identified 
specification, including the claims, as amended 
by any amendment referred to above. 



Je reconnais devoir divulguer toute 
information pcrtinente a la brcvetabilitd, 
comme d^fini dans le Titre 37 t § 1.56 du Code 
federal des r£glemenlalions. 



I acknowledge the duty to disclose information 
which is material to patentability as defined in 
Title 37, Code of Federal Regulations^ 1.56. 
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French Language Declaration 



Je rcvendique par le present acte avoir la priority etfangere, cn 
vcrtu du Titre 35, § 1 l9(aMd) ou § 365(b) du Code des Etats- 
Unis, sur toute demande etrangere dc brevet du certificat 
d'inventeur ou, en vertu du Titre 35, § 365(a) du memo Code, 
fiur route demande Internationale PCT designant ou moiru un 
pays autre quo les Etats-Unis ct figurant ci-dessous cU cn 
cochant la case, j*ai aussi indiquc ci-dessous toute demand* 
ctrangcrc dc brevet, lout certificat d'invemcur ou toutc demande 

intemationale PCT ay ant date de dep6t precedant cclle de la 
dttti&nde a propos do laqudlte uno priorilo uat rwondiquiig. 



Prior Foreign appiication(s) 

Demandc(s) de brevet anterieurc^s) dans un autre pays. 



(Number) 
(Numcro) 



(Country) 
(Pays) 



I hereby claim foreign priority under Title 35, United Slates 
Code, § 1 19(a>-(d) or § 365(b) of any foreign application(s) for 
patent or invenlor'b certificate, or § 365(a) of any PCT 
International application which designated at least one country 
other than the United State*, listed below, and have also 
identified below, by chocking the box, any foreign application 
for patent or inventor's certificate, or PCT International 
application having a tiling date before that of the application on 
which priority is claimed. 

Priprity claimed 
Droit dcpxinrilfe 



August 4, J 998 

(Doy/Month/Year Filed) 
(Jour/Moia/Annc dc depot) 



Yes 

Qui 



□ 

No 
Non 



(Number) 
(Numcro) 



(Country) 

(Pays) 



(Day/Month/Year Filed) 
(Jour/Mois/Anne de depot) 



□ 

Yes 

Oui 



□ 

No 
Non 



Je rcvendique par le present acte tout benefice, en vcrtu du Titre 
35, § 1 19(e) du Code des Etats-Unis, de toute demande de brevet 
provisoire efTcctuee aux Etats-Unis et figurant ci-deBsoua. 



I hereby claim the benefit under Title 35, United States Code. § 
119(e) Of any United Slates provisional application^) listed 
below. 



(Application No.) 
(N° de demande) 



(Filing Date) 
(□ate de dep6t) 



Jo rcvendique par le present acta tout hcn6fice, cn vcrtu du Titre 
35, § 120 du Code des Etats-Unis, de toute demande de brevet 
effectuee aux Etats-Unis, ou en vertu du Titre 35, § 365© du 
meme Code, de toute demande intemationale PCT d&ignant les 

Etats-Unis et figurant ci-dessous et, dans la mesure ou l'objet de 
chacune des revendications dc ccuo demande de brevet n'est pas 

divulgue dans la demande antcricurc amcricainc OU 

interna tionate PCT, en vertu des dispositions du premier 
paragraphs du Titre 35. § 112 du code des Etats-Unis, je 

reconnais devoir divulguer toute information pertinentc a la 
brevetabllitd, comme defmi dans Je Titre 37, § 1.56 du Code 
fcd6ral des rcglcmcniatione, dont j'ai pu disposer cntro la date do 
depot de la demande ant£rieurc et la date de depot de la demande 
rationale ou intemationale PCTdc la pr6scntc demande : 



(Application No.) 
(N* de demande) 



(Filing Date) 
(Date dc depot) 



I hereby claim the benefit under, Title 35, United States Code, § 
120 of any United States application^), pr § 365(c) of any PCT 
International application designating the United States, listed 
below and, insofar as the subject matter of each of the claims of 
this application is not disclosed in the prior United States or PCT 
International application in the manneT provided by the first 
paragraph of Title 35, United States Code, § 112, 1 acknowledge 
the duty to disclose information which is material tp 
patentability as defined in Title 37, Code of Fedeml Regulations, 
§ 1.56 which became available between the filing date of the 
prior application and the national or PCT international filing date 
of this application. 



(Application No.) 
(N* dc demande) 



(Filing Date) 
(Date de depot) 



(Status) (patented, pending^bandoned) 

(Staiul) (bfttveic, en mure d'cuirncn, abandonnc) 



(Application No.) 
(N* de demande) 



(Filing Date) 
(Date de depdt) 



Je declare que par le present acte que toute declaration cHnclusc 
C$t, 4 ma connai&sance, veridique et que toute declaration 
fofmulee h parti r de renseignements ou de suppositions est tenue 
pour veridique ;ct de plus, que toutes ces declarations ont etc 
formulees cn sachant que toute fausse declaration votontairo ou 
son equivalent est passible d'unc amende ou d'unc incarceration, 
ou des deux, en vertu de la section 1001 du Titre 18 du Code de 
Etals-Unls, et que de Idles declarations volontalremenl fausaes 
risquent de compromettre la validite de la demande de brevet ou 
du brevet delivre a parti r de celle-ci. 



(Status) (patented, pending,obnndoned) 

(Statui) (brevete, en cours d'examen, abandonnc) 

I hebcry declare that all statements made herein of my own 
knowledge are true and that oil statements made on information 
and belief are believed to be true ; and further that these 
statements were made with the knowledge that willful false 
statements and the like so made arc punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may 
jeopardize the validity of the application qr any patent issued 
thereon. 
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French Language Declaration 



PGUVOIRS : En tant que l'inventeur citd. je designe par 1ft 
prescntc r{lw) avocata(s) ct/ou agents) 9uivant(H) pour qu'ils 

pounmivcOiO la orovWora do e»tt* domand* de brevet ct 

irdiie(nt) louto affaire st'y rapportant avec I'Office dec brevets et 
de* marquees : (mentionner le nom et le numero 
d*enregistremcnt). 



POWER OF ATTORNEY : As a named inventor, I hereby 
appoint the following attorn cy<a) anp7or agentfn) to persecute 

this Application and transact all buGsinttc in the Patent and 

Trademark Office connected therewith : (list name and 
registration number) 



RAYMOND O. LINKER, JR. Registration No. 2o\419 



All practitioners associated with 

CUSTOMER NUMBER 000826 



Adresser tout* correapondance a : 



Send Correspondence to : 

ALSTON A BIRD llp 
" ^ — _£ank of America Plaza 

101 SouthJI^Ton Slrcct» Suite 4000 



AdreaacT tout appel telcphonique a : 
(nom et numero de telephone) 



Direct Telephone calls to : (name and telephone number) 
(704) 444-1000 



Ytx>- 



7™ 



Nom complete de I'unlque ou premier inventeur 
ELBAZJVathjOte^ 


Full name of sole or first inventor 


SiOTjurat 1' inventeur Date 


Inventor's signature Date 


'^th±m&£S (FRANCE) f^P~f^ 


Residence 


Nationality 

Francaise 


Citizenship 


Adtestt Po&tale 

7, PwsaRe des Italians 93170 BAGNOLET FRANCE 


Post Office Address 






Nom complete du acornd co-invontcur. 1c «u cclicant 


full name of ivccnnd joint inventor, if any 




Second inventor's signature Date 


Domicile j _ ^ j-~f\ ^ 
75003 PftjiiXERANCE) |^Hr-=- 


Residence 


Nationality ' 

Prangaise 


Citizenship 


Adresse Postale 

47 Rue B^tJliy 73003 PARIS FRANCE 


Post Office Address 







(Fouroir les mCmcs rcnscigncmcnts ct la signature de tout co- 
invenleur supplemental.) 



(Suppply similar information and signature for third and 
subsequent joint inventors,) 
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French Language Declaration 



Nom complete du iroisieme co-lnventeur, le cas echeant 

STROSEERG Arthur. Donn5L„ _ 


Full name of third joint inventor, if Qny 


"sTgnature dc I'invcntiuf fJ 




Third inventor* b signature nate 


Domicile TZdlf nL / 
7.^0 1 5 Pm U fFR ANOTO fY*-^" ' f 


Residence 


Nationality - 

Franc aise 


Citizafiffhin 


Adresse Postale 

66. Rue de Javel 7501 5 PARIS FRANCE 


Post Office Address 






Nom complete du qualriime coinventeur, le cas echeant 


Full name of fourth joint inventor, if any 


Signature de Titwenteur Datc 


VniirfVi invmtfir'x JtlLVrtHlurti DfltC 


Domicile 


Residence 


Nationality 




Adrease Postal? 


Post Off ice Address 






Nom complete du cinquieme co-inventeur. le cas echeant 


Full name uf fifth joint inventor, if any 


Signature dc Tinventcur Date 


Fifth inventor's aicmature Date 


Domicile 


Residence 


Nationality 


Citizenship 


Adresse Postale 


Post Office Address 






Nom complete du nixieme co-inventeur, le cai echeanl 


Full name of sixth joint inventor, if any 


Signature dc l*invenleur Date 


Sixth inventor's signature Date 


Domicile 


Residence 


Nationally 


Citinonship 


Adresse Postale 


Post Office Address 







{Foumir 1* mcmes renseignements et la signature de tout co- Supply similar information and signature for third and 

inventeur aupplementaire.) subsequent joint inventors.) 
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SEQUENCE LISTING 



<110> Elbaz, Nathalie 
Nahmias, Clara 
Strosberg, Arthur Donny 

<12 0> NUCLEIC SEQUENCES ENCODING AN AT 2 

RECEPTOR- INTERACTING PROTEIN (ATIP) AND THEIR APPLICATIONS 



<130> 33339/208804 

<140> US 09/762, 194 
<141> 

<150> PCT/FR99/01908 
<151> 1999-08-02 v 

<150> FR 98/09997 
<151> 1998-08-04 

<160> 12 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 1803 

<212> DNA 

<213> Mus musculus 

<220> 
<221> CDS 

<222> (178) . . . (1500) 
<400> 1 



60 



gctacccccc ccccacgcac cccccaatct gggtggcctg gcattagcat gtaagcttgt 
ttttctctgg ctgtatctct tggcctggaa gaaccccgag ttgccaagag acacagtatg 120 
tgatggtccc tggaaaagct gcttcccctg cgaagttctc ccactggctt cgaagac atg 180 

Met 
1 

ctg ttg tct ccc aaa ttc tec tta tec acc ate cac gtc cgc eta acc 228 
Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr lie His Val Arg Leu Thr 
5 10 15 

gee aaa gga ctg ctt cga aac etc egg ctt cct teg ggg etc agg aaa 276 
Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Leu Arg Lys 
20 25 30 

aac act gtc att ttc cac aca gtt gaa aag ggc agg cag aag aat ccc 324 
Asn Thr Val lie Phe His Thr Val Glu Lys Gly Arg Gin Lys Asn Pro 

- 1 - 
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35 40 45 

agg age ctg tgc ate cag acc cag aca get cca gat gtg ctg tec tec 372 
Arg Ser Leu Cys lie Gin Thr Gin Thr Ala Pro Asp Val Leu Ser Ser 
50 55 60 65 

gag aga acg ctt gag ttg gee caa tac aag aca aaa tgt gaa age caa 420 
Glu Arg Thr Leu Glu Leu Ala Gin Tyr Lys Thr Lys Cys Glu Ser Gin 
70 75 80 

agt gga ttc ate ctg cac etc agg cag ctt ctt tec cgt ggt aac aac 4 68 

Ser Gly Phe lie Leu His Leu Arg Gin Leu Leu Ser Arg Gly Asn Asn 
85 90 95 

aag ttt gaa gcg ctg aca gtt gtg ate cag cac etc ctg tct gag egg 516 
Lys Phe Glu Ala Leu Thr Val Val lie Gin His Leu Leu Ser Glu Arg 
100 105 110 

gag gaa gca ctg aag caa cac aaa acc etc tct caa gaa ctt gtc age 564 
Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val Ser 
115 120 125 

etc egg gga gag eta gtt get get tea age gee tgt gag aag eta gaa 612 
Leu Arg Gly Glu Leu Val Ala Ala Ser Ser Ala Cys Glu Lys Leu Glu 
130 135 140 145 

aag get agg get gac tta cag aca gcg tat caa gaa ttt gtc cag aaa 660 
Lys Ala Arg Ala Asp Leu Gin Thr Ala Tyr Gin Glu Phe Val Gin Lys 
150 155 160 

eta aac cag cag cat cag aca gac egg acg gaa ctg gag aac egg ctg 
Leu Asn Gin Gin His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg Leu 
165 170 175 

aag gac tta tac acc gca gag tgt gag aag ctt cag age att tac att 756 
Lys Asp Leu Tyr Thr Ala Glu Cys Glu Lys Leu Gin Ser lie Tyr lie 
180 185 190 

gag gag gca gaa aaa tat aaa act caa ctg caa gag cag ttt gac aac 8 04 

Glu Glu Ala Glu Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp Asn 
195 200 205 

tta aac gee gee cat gag acc act aag ctt gag att gaa get age cac 852 
Leu Asn Ala Ala His Glu Thr Thr Lys Leu Glu lie Glu Ala Ser His 
210 215 220 225 



teg gag aag gtg gaa ttg ctg aag aag acc tat gaa acc tec ctt tea 
Ser Glu Lys Val Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu Ser 
230 235 240 

gaa ate aag aag age cat gag atg gag aag aag tea ctg gag gat ctg 
Glu lie Lys Lys Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp Leu 
245 250 255 



708 



900 



948 



ctt aat gag aag cag gaa teg ctg gag aaa caa ate aat gat ctg aag 
Leu Asn Glu Lys Gin Glu Ser Leu Glu Lys Gin lie Asn Asp Leu Lys 
260 265 270 



agt gaa aac gat get tta aac gaa agg ttg aaa tea gag gag caa aag 
Ser Glu Asn Asp Ala Leu Asn Glu Arg Leu Lys Ser Glu Glu Gin Lys 
275 280 285 

caa ctg tea aga gag aag gcg aat tec aaa aac cct cag gtc atg tat 
Gin Leu Ser Arg Glu Lys Ala Asn Ser Lys Asn Pro Gin Val Met Tyr 
290 295 300 305 

ctg gag caa gaa eta gaa age ctg aag get gtg tta gag ate aag aat 
Leu Glu Gin Glu Leu Glu Ser Leu Lys Ala Val Leu Glu lie Lys Asn 
310 315 320 

gag aag ctg cac cag cag gac atg aag eta atg aag atg gaa aag ctg 
Glu Lys Leu His Gin Gin Asp Met Lys Leu Met Lys Met Glu Lys Leu 
325 330 335 

gtg gac aat aac aca gca ttg gtt gac aag ctg aag cga ttc cag cag 
Val Asp Asn Asn Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin Gin 
340 345 350 

gaa aac gag gag tta aaa get cgc atg gac aaa cac atg gca att tea 
Glu Asn Glu Glu Leu Lys Ala Arg Met Asp Lys His Met Ala lie Ser 
355 360 365 

agg caa ctt tec ace gag cag gee gcg ctg caa gag tec ctt gag aag 
Arg Gin Leu Ser Thr Glu Gin Ala Ala Leu Gin Glu Ser Leu Glu Lys 
370 375 380 385 

gag tea aag gtc aac aag aga ctg tec atg gag aac gag gaa ctt ctg 
Glu Ser Lys Val Asn Lys Arg Leu Ser Met Glu Asn Glu Glu Leu Leu 
390 395 400 

tgg aaa ctg cac aac gga gac ctg tgc age ccc aag aga tec ccc ace 
Trp Lys Leu His Asn Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro Thr 
405 410 415 

tec teg gee ate cct ttc cag tec ccc agg aat tct ggt tec ttc tec 
Ser Ser Ala lie Pro Phe Gin Ser Pro Arg Asn Ser Gly Ser Phe Ser 
420 ' 425 430 

age ccc age ate tea ccc aga tga eggcttctga aegcaggaga ctctctgaag 
Ser Pro Ser lie Ser Pro Arg * 
435 440 

gcactgaggt gcgcttctgc aggactgacc ctctcatggg aactcgagtt getgegttag 

ctctctggaa tatccccagg atategggag agcagccgcc aacegtatea getaegtacg 

aatagagagc tccaatagaa gacttttaac ttggtccaaa agcctcctcc aaaaacagat 

ttcggaactg aagtggacat agttgcacaa agcacttacg gaacgaggga accttgttct 
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ttgccttcct tcacctaagc ataggctttc cag 1803 

<210> 2 
<211> 440 
<212> PRT 

<213> Mus musculus 
<400> 2 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr lie His Val Arg Leu 

15 10 15 

Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Leu Arg 

20 25 30 

Lys Asn Thr Val lie Phe His Thr Val Glu Lys Gly Arg Gin Lys Asn 

35 40 45 

Pro Arg Ser Leu Cys lie Gin Thr Gin Thr Ala Pro Asp Val Leu Ser 

50 55 60 

Ser Glu Arg Thr Leu Glu Leu Ala Gin Tyr Lys Thr Lys Cys Glu Ser 
65 70 75 80 

Gin Ser Gly Phe lie Leu His Leu Arg Gin Leu Leu Ser Arg Gly Asn 

85 90 95 

Asn Lys Phe Glu Ala Leu Thr Val Val lie Gin His Leu Leu Ser Glu 

100 105 110 

Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 

115 120 125 

Ser Leu Arg Gly Glu Leu Val Ala Ala Ser Ser Ala Cys Glu Lys Leu 

130 135 140 

Glu Lys Ala Arg Ala Asp Leu Gin Thr Ala Tyr Gin Glu Phe Val Gin 
145 150 155 160 

Lys Leu Asn Gin Gin His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg 

165 170 175 

Leu Lys Asp Leu Tyr Thr Ala Glu Cys Glu Lys Leu Gin Ser lie Tyr 

180 185 190 

He Glu Glu Ala Glu Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp 

195 200 205 

Asn Leu Asn Ala Ala His Glu Thr Thr Lys Leu Glu He Glu Ala Ser 

210 215 220 

His Ser Glu Lys Val Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu 
225 230 235 240 

Ser Glu He Lys Lys Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp 

245 250 255 

Leu Leu Asn Glu Lys Gin Glu Ser Leu Glu Lys Gin He Asn Asp Leu 

260 265 270 

Lys Ser Glu Asn Asp Ala Leu Asn Glu Arg Leu Lys Ser Glu Glu Gin 

275 280 285 

Lys Gin Leu Ser Arg Glu Lys Ala Asn Ser Lys Asn Pro Gin Val Met 

290 295 300 

Tyr Leu Glu Gin Glu Leu Glu Ser Leu Lys Ala Val Leu Glu He Lys 
305 310 315 320 

Asn Glu Lys Leu His Gin Gin Asp Met Lys Leu Met Lys Met Glu Lys 

325 330 335 

Leu Val Asp Asn Asn Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin 

340 345 350 

Gin Glu Asn Glu Glu Leu Lys Ala Arg Met Asp Lys His Met Ala He 




355 360 365 

Ser Arg Gin Leu Ser Thr Glu Gin Ala Ala Leu Gin Glu Ser Leu Glu 

370 375 380 

Lys Glu Ser Lys Val Asn Lys Arg Leu Ser Met Glu Asn Glu Glu Leu 
385 390 395 400 

Leu Trp Lys Leu His Asn Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro 

405 410 415 

Thr Ser Ser Ala lie Pro Phe Gin Ser Pro Arg Asn Ser Gly Ser Phe 

420 425 430 

Ser Ser Pro Ser lie Ser Pro Arg 
435 440 



<210> 3 

<211> 1323 

<212> DNA 

<213> Mus musculus 



<220> 
<221> CDS 

<222> (1) . . . (1323) 



<400> 3 

atg ctg ttg tct ccc aaa ttc tec tta tec acc ate cac gtc cgc eta 48 
Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr lie His Val Arg Leu 
15 10 15 

acc gec aaa gga ctg ctt cga aac etc egg ctt cct teg ggg etc agg 96 
Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Leu Arg 
20 25 30 



aaa aac act gtc att ttc cac aca gtt gaa aag ggc agg cag aag aat 144 
Lys Asn Thr Val lie Phe His Thr Val Glu Lys Gly Arg Gin Lys Asn 
35 40 45 



ccc agg age ctg tgc ate cag acc cag aca get cca gat gtg ctg tec 192 
Pro Arg Ser Leu Cys lie Gin Thr Gin Thr Ala Pro Asp Val Leu Ser 
50 55 60 



tec gag aga acg ctt gag ttg gec caa tac aag aca aaa tgt gaa age 240 
Ser Glu Arg Thr Leu Glu Leu Ala Gin Tyr Lys Thr Lys Cys Glu Ser 
65 70 75 80 



caa agt gga ttc ate ctg cac etc agg cag ctt ctt tec cgt ggt aac 288 

- Gin Ser Gly Phe lie Leu His Leu Arg Gin Leu Leu Ser Arg Gly Asn 

85 90 95 

aac aag ttt gaa gcg ctg aca gtt gtg ate cag cac etc ctg tct gag 33 6 

Asn Lys Phe Glu Ala Leu Thr Val Val lie Gin His Leu Leu Ser Glu 

100 105 110 



egg gag gaa gca ctg aag caa cac aaa acc etc tct caa gaa ctt gtc 3 84 

Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 
115 120 125 



-5- 
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age etc egg gga gag eta gtt get get tea age gee tgt gag aag eta 432 
Ser Leu Arg Gly Glu Leu Val Ala Ala Ser Ser Ala Cys Glu Lys Leu 
130 135 140 



tat ctg gag caa gaa eta gaa age ctg aag get gtg tta gag ate aag 
Tyr Leu Glu Gin Glu Leu Glu Ser Leu Lys Ala Val Leu Glu lie Lys 
305 310 315 320 

aat gag aag ctg cac cag cag gac atg aag eta atg aag atg gaa aag 
Asn Glu Lys Leu His Gin Gin Asp Met Lys Leu Met Lys Met Glu Lys 
325 330 335 



480 



528 



576 



gaa aag get agg get gac tta cag aca gcg tat caa gaa ttt gtc cag 
Glu Lys Ala Arg Ala Asp Leu Gin Thr Ala Tyr Gin Glu Phe Val Gin 
145 150 155 160 

aaa eta aac cag cag cat cag aca gac egg acg gaa ctg gag aac egg 
Lys Leu Asn Gin Gin His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg 
165 170 175 

ctg aag gac tta tac acc gca gag tgt gag aag ctt cag age att tac 
Leu Lys Asp Leu Tyr Thr Ala Glu Cys Glu Lys Leu Gin Ser He Tyr 
180 185 190 

att gag gag gca gaa aaa tat aaa act caa ctg caa gag cag ttt gac 624 
He Glu Glu Ala Glu Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp 
195 200 205 

aac tta aac gee gee cat gag acc act aag ctt gag att gaa get age 672 
Asn Leu Asn Ala Ala His Glu Thr Thr Lys Leu Glu He Glu Ala Ser 
210 215 220 

cac teg gag aag gtg gaa ttg ctg aag aag acc tat gaa acc tec ctt 720 
His Ser Glu Lys Val Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu 
225 230 235 240 

tea gaa ate aag aag age cat gag atg gag aag aag tea ctg gag gat 768 
Ser Glu He Lys Lys Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp 
245 250 255 

ctg ctt aat gag aag cag gaa teg ctg gag aaa caa ate aat gat ctg 816 
Leu Leu Asn Glu Lys Gin Glu Ser Leu Glu Lys Gin He Asn Asp Leu 
260 265 270 

aag agt gaa aac gat get tta aac gaa agg ttg aaa tea gag gag caa 
Lys Ser Glu Asn Asp Ala Leu Asn Glu Arg Leu Lys Ser Glu Glu Gin 
275 280 285 

aag caa ctg tea aga gag aag gcg aat tec aaa aac cct cag gtc atg 912 
Lys Gin Leu Ser Arg Glu Lys Ala Asn Ser Lys Asn Pro Gin Val Met 
290 295 300 



864 



960 



1008 
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ctg gtg gac aat aac aca gca ttg gtt gac aag ctg aag cga ttc cag 1056 
Leu Val Asp Asn Asn Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin 
340 345 350 

cag gaa aac gag gag tta aaa get cgc atg gac aaa cac atg gca att 1104 
Gin Glu Asn Glu Glu Leu Lys Ala Arg Met Asp Lys His Met Ala lie 
355 360 365 

tea agg caa ctt tec acc gag cag gec gcg ctg caa gag tec ctt gag 1152 
Ser Arg Gin Leu Ser Thr Glu Gin Ala Ala Leu Gin Glu Ser Leu Glu 
370 375 380 

aag gag tea aag gtc aac aag aga ctg tec atg gag aac gag gaa ctt 12 00 

Lys Glu Ser Lys Val Asn Lys Arg Leu Ser Met Glu Asn Glu Glu Leu 
385 390 395 400 

ct9 tgg aaa ctg cac aac gga gac ctg tgc age ccc aag aga tec ccc 1248 
Leu Trp Lys Leu His Asn Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro 
405 410 415 

acc tec teg gec ate cct ttc cag tec ccc agg aat tct ggt tec ttc 1296 
Thr Ser Ser Ala lie Pro Phe Gin Ser Pro Arg Asn Ser Gly Ser Phe 
420 425 430 

tec age ccc age ate tea ccc aga tga 1323 
Ser Ser Pro Ser lie Ser Pro Arg * 
435 440 



<210> 4 
<211> 440 
<212> PRT 

<213> Mus musculus 

<400> 4 x 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr lie His Val Arg Leu 

15 10 15 

Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Leu Arg 

20 25 30 

Lys Asn Thr Val lie Phe His Thr Val Glu Lys Gly Arg Gin Lys Asn 

35 40 45 

Pro Arg Ser Leu Cys lie Gin Thr Gin Thr Ala Pro Asp Val Leu Ser 

50 55 60 

Ser Glu Arg Thr Leu Glu Leu Ala Gin Tyr Lys Thr Lys Cys Glu Ser 
65 70 75 80 

Gin Ser Gly Phe lie Leu His Leu Arg Gin Leu Leu Ser Arg Gly Asn 

85 90 95 

Asn Lys Phe Glu Ala Leu Thr Val Val lie Gin His Leu Leu Ser Glu 

100 105 110 

Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 

115 120 125 

Ser Leu Arg Gly Glu Leu Val Ala Ala Ser Ser Ala Cys Glu Lys Leu 
130 135 140 
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Glu Lys Ala Arg Ala Asp Leu Gin Thr Ala Tyr Gin Glu Phe Val Gin 
145 150 155 160 

Lys Leu Asn Gin Gin His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg 

165 170 175 

Leu Lys Asp Leu Tyr Thr Ala Glu Cys Glu Lys Leu Gin Ser He Tyr 

180 185 190 

He Glu Glu Ala Glu Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp 

195 200 205 

Asn Leu Asn Ala Ala His Glu Thr Thr Lys Leu Glu He Glu Ala Ser 

210 215 220 

His Ser Glu Lys Val Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu 
225 230 235 240 

Ser Glu He Lys Lys Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp 

245 250 255 

Leu Leu Asn Glu Lys Gin Glu Ser Leu Glu Lys Gin He Asn Asp Leu 

260 265 270 

Lys Ser Glu Asn Asp Ala Leu Asn Glu Arg Leu Lys Ser Glu Glu Gin 

275 280 285 

Lys Gin Leu Ser Arg Glu Lys Ala Asn Ser Lys Asn Pro Gin Val Met 

290 295 300 

Tyr Leu Glu Gin Glu Leu Glu Ser Leu Lys Ala Val Leu Glu He Lys 
305 310 315 320 

Asn Glu Lys Leu His Gin Gin Asp Met Lys Leu Met Lys Met Glu Lys 

325 330 335 

Leu Val Asp Asn Asn Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin 

340 345 350 

Gin Glu Asn Glu Glu Leu Lys Ala Arg Met Asp Lys His Met Ala lie 

355 360 365 

Ser Arg Gin Leu Ser Thr Glu Gin Ala Ala Leu Gin Glu Ser Leu Glu 

370 375 380 

Lys Glu Ser Lys Val Asn Lys Arg Leu Ser Met Glu Asn Glu Glu Leu 
385 390 395 400 

Leu Trp Lys Leu His Asn Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro 

405 410 415 

Thr Ser Ser Ala He Pro Phe Gin Ser Pro Arg Asn Ser Gly Ser Phe 

420 425 430 

Ser Ser Pro Ser He Ser Pro Arg 
435 440 



<210> 5 

<211> 354 

<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (354) 

<223> Insert identified by two-hybrid screening of a M. 
musculus foetal cDNA library 

<223> Insert identified by two-hybrid screening of a M. 
musculus foetal cDNA library 
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<400> 5 

cat cag aca gac egg acg gaa ctg gag aac egg ctg aag gac tta tac 
His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg Leu Lys Asp Leu Tyr 



1 



5 10 15 



get tta aac gaa agg ttg 
Ala Leu Asn Glu Arg Leu 
115 



48 



96 



ace gca gag tgt gag aag ctt cag age att tac att gag gag gca gaa 
Thr Ala Glu Cys Glu Lys Leu Gin Ser He Tyr He Glu Glu Ala Glu 
20 25 30 

aaa tat aaa act caa ctg caa gag cag ttt gac aac tta aac gec gec 144 
Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp Asn Leu Asn Ala Ala 
35 40 45 

cat gag ace act aag ctt gag att gaa get age cac teg gag aag gtg 192 
His Glu Thr Thr Lys Leu Glu He Glu Ala Ser His Ser Glu Lys Val 
50 55 60 

gaa ttg ctg aag aag ace tat gaa ace tec ctt tea gaa ate aag aag 240 
Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu Ser Glu He Lys Lys 
65 70 75 80 

age cat gag atg gag aag aag tea ctg gag gat ctg ctt aat gag aag 288 
Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp Leu Leu Asn Glu Lys 
85 90 95 

cag gaa teg ctg gag aaa caa ate aat gat ctg aag agt gaa aac gat 33 6 

Gin Glu Ser Leu Glu Lys Gin lie Asn Asp Leu Lys Ser Glu Asn Asp 
100 105 110 



354 



<210> 6 

<211> 118 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Insert identified by yeast two hybrid screening of 
a M. musculus fetal cDNA library 

<400> 6 

His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg Leu Lys Asp Leu Tyr 

1 5 10 15 

Thr Ala Glu Cys Glu Lys Leu Gin Ser He Tyr He Glu Glu Ala Glu 

20 25 30 

Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp Asn Leu Asn Ala Ala 

35 40 45 

His Glu Thr Thr Lys Leu Glu He Glu Ala Ser His Ser Glu Lys Val 

50 55 60 

Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu Ser Glu He Lys Lys 
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65 

Ser His Glu Met 

Gin Glu Ser Leu 
100 

Ala Leu Asn Glu 
115 



70 

Glu Lys Lys Ser 
85 

Glu Lys Gin lie 
Arg Leu 



75 

Leu Glu Asp Leu 
90 

Asn Asp Leu Lys 
105 



80 

Leu Asn Glu Lys 
95 

Ser Glu Asn Asp 
110 



<210> 7 

<211> 3742 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 

<222> (293) . . . (1600) 



<400> 7 

cagtgtgatg 

aagagcatca 

cctgggggat 

tttcctctga 

cagtccatgg 



tggttcagag 
ttttggcaac 
ttttttcttc 
ctgtatttct 
aaattgcctc 



gcagcttcta 
atctgaaagt 
tatgcctctg 
tggccttgaa 
ttctgtgaaa 



gacctgcagg 
gaaaacggaa 
tggtggaatg 
gagtactgag 
tctcgccacc 



agggagattg 
gccagaaaca 
acatttgctg 
tttaaaaaga 
tgctccgaag 



tattcagagg 
cttggccagc 
tgtaggcatc 
cagtatgtga 
ac atg ttg 
Met Leu 



ttg tct ccc aaa ttc tec tta tec acc att cac ata cga ctg acg gec 
Leu Ser Pro Lys Phe Ser Leu Ser Thr He His He Arg Leu Thr Ala 
5 10 15 

aaa gga ttg ctt cga aac ctt cga ctt cct tea ggg ttt agg aga age 
Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Phe Arg Arg Ser 
20 25 30 

act gtt gtt ttc cac aca gtt gaa aag age agg caa aag aat cct cga 
Thr Val Val Phe His Thr Val Glu Lys Ser Arg Gin Lys Asn Pro Arg 
35 40 45 50 

age tta tgt ate cag cca cag aca get ccc gat gcg ctg ccc cct gag 
Ser Leu Cys He Gin Pro Gin Thr Ala Pro Asp Ala Leu Pro Pro Glu 
55 60 65 



aaa aca ctt gaa ttg acg caa tat aaa aca aaa tgt gaa aac caa agt 
Lys Thr Leu Glu Leu Thr Gin Tyr Lys Thr Lys Cys Glu Asn Gin Ser 
70 75 80 



gga ttt ate ctg cag etc aag cag ctt ctt gee tgt ggt aat acc aag 
Gly Phe He Leu Gin Leu Lys Gin Leu Leu Ala Cys Gly Asn Thr Lys 
85 90 95 



ttt gag gca ttg aca gtt gtg att cag cac ctg ctg tct gag egg gag 
Phe Glu Ala Leu Thr Val Val He Gin His Leu Leu Ser Glu Arg Glu 
100 105 HO 
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682 



730 



gaa gca ctg aaa caa cac aaa acc eta tct caa gaa ctt gtt aac etc 
Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val Asn Leu 
115 120 125 130 

egg gga gag eta gtc act get tea acc acc tgt gag aaa tta gaa aaa 
Arg Gly Glu Leu Val Thr Ala Ser Thr Thr Cys Glu Lys Leu Glu Lys 
135 140 145 

gec agg aat gag tta caa aca gtg tat gaa gca ttc gtc cag cag cac 778 
Ala Arg Asn Glu Leu Gin Thr Val Tyr Glu Ala Phe Val Gin Gin His 
150 155 160 

cag get gaa aaa aca gaa cga gag aat egg ctt aaa gag ttt tac acc 826 
Gin Ala Glu Lys Thr Glu Arg Glu Asn Arg Leu Lys Glu Phe Tyr Thr 
165 170 175 

agg gag tat gaa aag ctt egg gac act tac att gaa gaa gca gag aag 8 74 

Arg Glu Tyr Glu Lys Leu Arg Asp Thr Tyr lie Glu Glu Ala Glu Lys 
180 185 190 

tac aaa atg caa ttg caa gag cag ttt gac aac tta aat gcg cat gaa 922 
Tyr Lys Met Gin Leu Gin Glu Gin Phe Asp Asn Leu Asn Ala His Glu 
195 200 205 210 

acc tct aag ttg gaa att gaa get age cac tea gag aaa ctt gaa ttg 970 
Thr Ser Lys Leu Glu lie Glu Ala Ser His Ser Glu Lys Leu Glu Leu 
215 220 225 

eta aag aag gec tat gaa gee tec ctt tea gaa att aag aaa ggc cat 1018 
Leu Lys Lys Ala Tyr Glu Ala Ser Leu Ser Glu lie Lys Lys Gly His 
230 235 240 

gaa ata gaa aag aaa teg ctt gaa gat tta ctt tct gag aag cag gaa 1066 
Glu lie Glu Lys Lys Ser Leu Glu Asp Leu Leu Ser Glu Lys Gin Glu 
245 250 255 

teg eta gag aag caa ate aat gat ctg aag agt gaa aat gat get tta 1114 
Ser Leu Glu Lys Gin lie Asn Asp Leu Lys Ser Glu Asn Asp Ala Leu 
260 265 270 

aat gaa aaa ttg aaa tea gaa gaa caa aaa aga aga gca aga gaa aaa 1162 
Asn Glu Lys Leu Lys Ser Glu Glu Gin Lys Arg Arg Ala Arg Glu Lys 
275 280 285 290 

gca aat ttg aaa aat cct cag ate atg tat eta gaa cag gag tta gaa 1210 
Ala Asn Leu Lys Asn Pro Gin lie Met Tyr Leu Glu Gin Glu Leu Glu 
295 300 305 

age ctg aaa get gtg tta gag ate aag aat gag aaa ctg cat caa cag 1258 
Ser Leu Lys Ala Val Leu Glu lie Lys Asn Glu Lys Leu His Gin Gin 
310 315 320 
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gac ate aag tta atg'aaa atg gag aaa ctg gtg gac aac aac aca gca 13 06 

Asp lie Lys Leu Met Lys Met Glu Lys Leu Val Asp Asn Asn Thr Ala 
325 330 335 

ttg gtt gac aaa ttg aag cgt ttc cag cag gag aat gaa gaa ttg aaa 13 54 

Leu Val Asp Lys Leu Lys Arg Phe Gin Gin Glu Asn Glu Glu Leu Lys 
340 345 350 

get egg atg gac aag cac atg gca ate tea agg cag ctt tec acg gag 14 02 

Ala Arg Met Asp Lys His Met Ala lie Ser Arg Gin Leu Ser Thr Glu 
355 360 365 370 

cag get gtt ctg caa gag teg ctg gag aag gag teg aaa gtc aac aag 1450 
Gin Ala Val Leu Gin Glu Ser Leu Glu Lys Glu Ser Lys Val Asn Lys 
375 380 385 

cga etc tct atg gaa aac gag gag ctt ctg tgg aaa ctg cac aat ggg 14 98 

Arg Leu Ser Met Glu Asn Glu Glu Leu Leu Trp Lys Leu His Asn Gly 
390 395 400 

gac ctg tgt age ccc aag aga tec ccc aca tec tec gee ate cct ttg 1546 
Asp Leu Cys Ser Pro Lys Arg Ser Pro Thr Ser Ser Ala lie Pro Leu 
405 410 415 

cag tea cca agg aat teg ggc tec ttc cct age ccc age att tea ccc 1594 
Gin Ser Pro Arg Asn Ser Gly Ser Phe Pro Ser Pro Ser lie Ser Pro 
420 425 430 

aga tga cacgtcccca aagtccacag actctctgaa agcattttga tgcaggtctg 1650 

Arg * 

435 

caggactgac cccaaggagg aacgtgggca caagaggtat atcagcacac gtgtgatcac 1710 

cgtaggtaac tggagegtea ccaccggcgg aatcgagctt ctgagactgg aagtctggag 1770 

gaagactttt gcctccgtcc aaaagattcc tccaaaaaaa gatttaaaaa aagatttegg 183 0 

catcgacacg gacgttgttg cacaaagcac ttaaagaacg agagcatctt gttcattgee 1890 

tttttcacct aagcataagg ggaaaaactc tcagggccct attaagattt ataacctttg 1950 

taatgttctt caccacagac accttcttgt gagttttcag tctgactgtg ggggtggggg 2010 

gtgtgaatga aatggatgtc acagagtgtc atgtgtctga tgcagcctcc tctgctgtgt 2070 

attaaatgtc aaaatctgaa tatatctgga tatgtactaa tcaaataata atcaatcaat 2130 

cagcatatac atttcageca aagecataga agaaaaagca atagttgett gaattatgat 2190 

catctaccac caactctgct cagccctgta acagggtagg gagagggtat aacaggaaga 2250 

gctttgactt gtccctgtct atacattctc tgtatctttt gggggtaact tcttggcagt 2310 

ttttcagtgt teagecatgt cagttgaaac tagatttttc tgtagatttt ttacttaccc 2370 

atgtgagcct aacactatcc tgtaattcat tttctcaggc tatgtgtaaa tgtagaaccc 2430 

taatttttct ataaaaaaac aaactaacta actgtgtaaa gaaagaaaaa gggaagtacc 2490 

aatgggtttt tccaccttat ttttaccttt gatctaccct tgcagattta acctgtcttc 2550 

ttccctccca ttattctcat tttcctttta cctttctcca ccatccagag ccacaaaagc 2610 

aaaccttcta cctcctacct acttttctct gggacaagga taaaggaata tgattttcca 2670 

gagccccaga gccagctcat cttccaggtg ctgaaaccac tttccaaata aactaaagee 2730 

tggatttgat attacaaatt ttgggaaatc ttagaataaa gaacgagaac aaggaagtca 2790 

ttggctagta taattaagaa aggtaggatt cagtgettae egatgatgea gtacttgata 2850 

gaagaaaaca gtctgggagg atagegctea tttttcagtt accctttaag gagtcccttt 2910 
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gtctttggga aagtagcaga atggtccgct tctttcccat gagtggaaaa tgtggcttgt 2 970 

ccaactctcc tccaggttgc atttcagttt ctttccaaaa cttattacct cccctaatcc 3030 

tgagactttg gaaaaggtgg aaggaagaac tgttgcttta tctccccctc cctgcatgtg 3090 

tcaacattgt gatgtcagta tttactaatc tacattcagt ggctgtacaa ataacagctg 3150 

tagtaagaag agattcagga tgctagaggt gaatatttgg gtcatttaca tgtacactac 3210 

atagcaagtt gatactcatg ttgcatgttc ttttaaatta gtgattttgt gtcttaagtc 32 70 

tttaacttcc aatacttcat catgtatgta accttccatg tttgcttctg ataaatggaa 3330 

atgtaggttc actgccactt catgagatat ctctgctcac gcttccaagt tgttctcaat 33 90 

gacattagcc aaagttgggt ttgccattca tcccctaggc atggtaaatc ttgtgttgtt 3450 

ccctgctgtc ctccgtatta cgtgaccggc aaataaatct catagcagtt aatataaaac 3510 

atctttggag gatgggagag aacaggaggg aagatgggaa acaaaataga gaattcttaa 3570 

gattttgttt aaaccaaatg tttcatgtag aatgcaaaat gttggcacgt caaaaatatg 3630 

aatgtgtaga caactgtagt tgtgctcagt ttgtagtgat gggaagtgta ttttactctg 3 6 90 

atcaaataaa taatgctgga atactcaaaa aaaaaaaaaa aaaaaaaaaa aa 3742 

<210> 8 
<211> 435 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr He His He Arg Leu 

15 10 15 

Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Phe Arg 

20 25 30 

Arg Ser Thr Val Val Phe His Thr Val Glu Lys Ser Arg Gin Lys Asn 

35 40 45 

Pro Arg Ser Leu Cys He Gin Pro Gin Thr Ala Pro Asp Ala Leu Pro 

50 55 60 

Pro Glu Lys Thr Leu Glu Leu Thr Gin Tyr Lys Thr Lys Cys Glu Asn 
65 70 75 80 

Gin Ser Gly Phe He Leu Gin Leu Lys Gin Leu Leu Ala Cys Gly Asn 

85 90 95 

Thr Lys Phe Glu Ala Leu Thr Val Val He Gin His Leu Leu Ser Glu 

100 105 110 

Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 

115 120 125 

Asn Leu Arg Gly Glu Leu Val Thr Ala Ser Thr Thr Cys Glu Lys Leu 

130 135 140 

Glu Lys Ala Arg Asn Glu Leu Gin Thr Val Tyr Glu Ala Phe Val Gin 
145 150 155 160 

Gin His Gin Ala Glu Lys Thr Glu Arg Glu Asn Arg Leu Lys Glu Phe 

165 170 175 

Tyr Thr Arg Glu Tyr Glu Lys Leu Arg Asp Thr Tyr He Glu Glu Ala 

180 185 190 

Glu Lys Tyr Lys Met Gin Leu Gin Glu Gin Phe Asp Asn Leu Asn Ala 

195 200 205 

His Glu Thr Ser Lys Leu Glu He Glu Ala Ser His Ser Glu Lys Leu 

210 215 220 

Glu Leu Leu Lys Lys Ala Tyr Glu Ala Ser Leu Ser Glu He Lys Lys 
225 230 235 240 

Gly His Glu He Glu Lys Lys Ser Leu Glu Asp Leu Leu Ser Glu Lys 
245 250 255 
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Gin Glu Ser Leu Glu Lys Gin lie Asn Asp Leu Lys Ser Glu Asn Asp 

260 265 270 

Ala Leu Asn Glu Lys Leu Lys Ser Glu Glu Gin Lys Arg Arg Ala Arg 

275 280 285 

Glu Lys Ala Asn Leu Lys Asn Pro Gin lie Met Tyr Leu Glu Gin Glu 

290 295 300 

Leu Glu Ser Leu Lys Ala Val Leu Glu lie Lys Asn Glu Lys Leu His 
305 310 315 320 

Gin Gin Asp He Lys Leu Met Lys Met Glu Lys Leu Val Asp Asn Asn 

325 330 335 

Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin Gin Glu Asn Glu Glu 

340 345 350 

Leu Lys Ala Arg Met Asp Lys His Met Ala He Ser Arg Gin Leu Ser 

355 360 365 

Thr Glu Gin Ala Val Leu Gin Glu Ser Leu Glu Lys Glu Ser Lys Val 

370 375 380 

Asn Lys Arg Leu Ser Met Glu Asn Glu Glu Leu Leu Trp Lys Leu His 
385 390 395 400 

Asn Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro Thr Ser Ser Ala lie 

405 410 415 

Pro Leu Gin Ser Pro Arg Asn Ser Gly Ser Phe Pro Ser Pro Ser He 
420 425 430 

Ser Pro Arg 
435 

<210> 9 
<211> 1308 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> (1) . . . (1308) 

<400> 9 

atg ttg ttg tct ccc aaa ttc tec tta tec acc att cac ata cga ctg 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr He His He Arg Leu 
15 10 15 

acg gec aaa gga ttg ctt cga aac ctt cga ctt cct tea ggg ttt agg 
Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Phe Arg 
20 25 30 

aga age act gtt gtt ttc cac aca gtt gaa aag age agg caa aag aat 144 
Arg Ser Thr Val Val Phe His Thr Val Glu Lys Ser Arg Gin Lys Asn 
35 40 45 

cct cga age tta tgt ate cag cca cag aca get ccc gat gcg ctg ccc 192 
Pro Arg Ser Leu Cys He Gin Pro Gin Thr Ala Pro Asp Ala Leu Pro 
50 55 60 



48 



96 



cct gag aaa aca ctt gaa ttg acg caa tat aaa aca aaa tgt gaa aac 
Pro Glu Lys Thr Leu Glu Leu Thr Gin Tyr Lys Thr Lys Cys Glu Asn 



240 
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65 70 75 80 

caa agt gga ttt ate ctg cag etc aag cag ctt ctt gec tgt ggt aat 288 
Gin Ser Gly Phe lie Leu Gin Leu Lys Gin Leu Leu Ala Cys Gly Asn 
85 90 95 

acc aag ttt gag gca ttg aca gtt gtg att cag cac ctg ctg tct gag 336 
Thr Lys Phe Glu Ala Leu Thr Val Val He Gin His Leu Leu Ser Glu 
100 105 110 

egg gag gaa gca ctg aaa caa cac aaa acc eta tct caa gaa ctt gtt 3 84 

Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 
115 120 125 

aac etc egg gga gag eta gtc act get tea acc acc tgt gag aaa tta 432 
Asn Leu Arg Gly Glu Leu Val Thr Ala Ser Thr Thr Cys Glu Lys Leu 
130 135 140 

gaa aaa gee agg aat gag tta caa aca gtg tat gaa gca ttc gtc cag 4 80 

Glu Lys Ala Arg Asn Glu Leu Gin Thr Val Tyr Glu Ala Phe Val Gin 
145 150 155 160 

cag cac cag get gaa aaa aca gaa cga gag aat egg ctt aaa gag ttt 52 8 

Gin His Gin Ala Glu Lys Thr Glu Arg Glu Asn Arg Leu Lys Glu Phe 
165 170 175 

tac acc agg gag tat gaa aag ctt egg gac act tac att gaa gaa gca 576 
Tyr Thr Arg Glu Tyr Glu Lys Leu Arg Asp Thr Tyr He Glu Glu Ala 
180 185 190 

gag aag tac aaa atg caa ttg caa gag cag ttt gac aac tta aat gcg 624 
Glu Lys Tyr Lys Met Gin Leu Gin Glu Gin Phe Asp Asn Leu Asn Ala 
195 200 205 

cat gaa acc tct aag ttg gaa att gaa get age cac tea gag aaa ctt 
His Glu Thr Ser Lys Leu Glu He Glu Ala Ser His Ser Glu Lys Leu 
210 215 220 

gaa ttg eta aag aag gee tat gaa gec tec ctt tea gaa att aag aaa 
Glu Leu Leu Lys Lys Ala Tyr Glu Ala Ser Leu Ser Glu He Lys Lys 
225 230 235 240 

ggc cat gaa ata gaa aag aaa teg ctt gaa gat tta ctt. tct gag aag 
Gly His Glu He Glu Lys Lys Ser Leu Glu Asp Leu Leu Ser Glu Lys 
245 250 255 

cag gaa teg eta gag aag caa ate aat gat ctg aag agt gaa aat gat 816 
Gin Glu Ser Leu Glu Lys Gin He Asn Asp Leu Lys Ser Glu Asn Asp 
260 265 270 



672 



720 



768 



get tta aat gaa aaa ttg aaa tea gaa gaa caa aaa aga aga gca aga 
Ala Leu Asn Glu Lys Leu Lys Ser Glu Glu Gin Lys Arg Arg Ala Arg 
275 280 285 



864 
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gaa aaa gca 
Glu Lys Ala 
290 

tta gaa age 
Leu Glu Ser 
305 

caa cag gac 
Gin Gin Asp 



aca gca ttg 
Thr Ala Leu 



ttg aaa get 
Leu Lys Ala 
355 

acg gag cag 
Thr Glu Gin 
370 

aac aag cga 
Asn Lys Arg 
385 

aat ggg gac 
Asn Gly Asp 



cct ttg cag 
Pro Leu Gin 



tea ccc aga 
Ser Pro Arg 
435 



aat ttg aaa 
Asn Leu Lys 



ctg aaa get 
Leu Lys Ala 
310 

ate aag tta 
lie Lys Leu 
325 

gtt gac aaa 
Val Asp Lys 
340 

egg atg gac 
Arg Met Asp 



get gtt ctg 
Ala Val Leu 



etc tct atg 
Leu Ser Met 
390 

ctg tgt age 
Leu Cys Ser 
405 

tea cca agg 
Ser Pro Arg 
420 

tga 



aat cct cag 
Asn Pro Gin 
295 

gtg tta gag 
Val Leu Glu 



atg aaa atg 
Met Lys Met 



ttg aag cgt 
Leu Lys Arg 
345 

aag cac atg 
Lys His Met 
360 

caa gag teg 
Gin Glu Ser 
375 

gaa aac gag 
Glu Asn Glu 



ccc aag aga 
Pro Lys Arg 



aat teg ggc 
Asn Ser Gly 
425 



ate atg tat 
lie Met Tyr 
300 

ate aag aat 
lie Lys Asn 
315 

gag aaa ctg 
Glu Lys Leu 
330 

ttc cag cag 
Phe Gin Gin 



gca ate tea 
Ala lie Ser 



ctg gag aag 
Leu Glu Lys 
380 

gag ctt ctg 
Glu Leu Leu 
395 

tec ccc aca 
Ser Pro Thr 
410 

tec ttc cct 
Ser Phe Pro 



eta gaa cag 
Leu Glu Gin 



gag aaa ctg 
Glu Lys Leu 



gtg gac aac 
Val Asp Asn 
335 

gag aat gaa 
Glu Asn Glu 
350 

agg cag ctt 
Arg Gin Leu 
365 

gag teg aaa 
Glu Ser Lys 



tgg aaa ctg 
Trp Lys Leu 



tec tec gee 
Ser Ser Ala 
415 

age ccc age 
Ser Pro Ser 
430 



gag 912 
Glu 



cat 960 

His 

320 

aac 1008 
Asn 



gaa 1056 
Glu 



tec 1104 
Ser 



gtc 1152 
Val 



cac 1200 

His 

400 

ate 1248 
He 



att 1296 
He 



1308 



<210> 10 
<211> 435 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr He His He Arg Leu 

15 10 15 

Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Phe Arg 

20 25 30 

Arg Ser Thr Val Val Phe His Thr Val Glu Lys Ser Arg Gin Lys Asn 
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35 










4 0 










4 b 








Pro 


Arg 


Ser 


Leu 


Cys 


He 


Gin 


Pro 


bill 


inr 


Aid 


Pro 


Asp 


ai a 


Leu 


Pro 




50 










55 




















Pro 


Glu 


Lys 


Thr 


Leu 


Glu 


Leu 


Thr 


Gin 


Tyr 


Lys 


mr 


Lys 


Cys 


nl„ 
blU 


Asn 


65 










I U 




















80 


Gin 


Ser 


Gly 


Phe 


He 


Leu 


Gin 


Leu 


Lys 


bin 


Leu 


Leu 


Aid. 


{""WO 

by o 


biy 


Asn 










85 










q n 










95 




Thr 


Lys 


Phe 


Glu 


Ala 


Leu 


Thr 


vai 


Vdl 


Tl a 

lie 


m n 




Leu 


Leu 




Glu 








100 










i u b 










110 






Arg 


Glu 


Glu 


Ala 


Leu 


Lys 


bin 


Hi s 


Lys 


Thr 


Leu 


C Q -K- 

OCX 


Gin 


Glu 


LGU 


Val 






1 1 b 




















125 








Asn 


Leu 


Arg 


Gly Glu 


Leu 


Val 


Thr 


TV 1 -a 
Aid 


Ser 


TVi -y- 

i in 


Tnr 


Lyb 


m ii 


Lys 


Leu 




130 










TIC 




















Glu 


Lys 


Ala 


Arg 


Asn 


Glu 


Leu 


r*~\ « 

Gin 


Thr 


vai 


Tyr 


blu 


Aid 


irne 


v ai 


bill 


145 










150 










lbb 










lb U 


Gin 


His 


Gin 


Ala 


Glu 


Lys 


Thr 


Glu 


Arg 


Glu 


Asn 


Arg 


Leu 


Lys 


Glu 


Phe 










165 










170 










1 /b 




Tyr 


Thr 


Arg 


Glu 


Tyr 


Glu 


Lys 


Leu 


Arg 


Asp 


Thr 


Tyr 


lie 


Glu 


Glu 


Ala 








180 










185 










i y u 






Glu 


Lys 


Tyr 


Lys 


Met 


Gin 


Leu 


Gin 


Glu 


Gin 


Phe 


Asp 


Asn 


Leu 


Asn 


Aia 






195 










2 00 


















His 


Glu 


Thr 


Ser 


Lys 


Leu 


Glu 


He 


Glu 


Ala 


Ser 


Hi s 


Ser 


blU 


Lys 


Leu 




210 










2. 1 b 










9 o n 
z ^ u 










Glu 


Leu 


Leu 


Lys 


Lys 


Ala 


Tyr 


Glu 


Ala 


Ser 


Leu 


Ser 


blU 


1 ± c 


Lys 


Lys 


225 






























240 


Gly His 


Glu 


lie 


bill 


Lys 


Lys 


Ser 


Leu 


blU 


Asp 


Leu 


Leu 


Ser 


m n 

UlU 


Lys 










245 










ZDU 










j£ _> o 




Gin 


Glu 


Ser 


Leu 


Glu 


Lys 


bin 


Tie 

ire 


Asn 


Asp 


Leu 


Lys 


Ser 


Glu 


Asn 


Asp 








260 




















27 0 






Ala 


Leu 


Asn 


Glu 


Lys 


Leu 


Lys 


Ser 


Glu 


blU 


bin 


Lys 


Arg 


Arg 


Ala 


Aiy 






275 










too 










2 85 








Glu 


Lys 


Ala 


Asn 


Leu 


Lys 


Asn 


Pro 


bin 


lie 




Tyr 


Leu 


Glu 


Gin 


Glu 




290 




















3 0 0 










Leu 


Glu 


Ser 


Leu 


Lys 


Ala 


vai 


Leu 


r*i n 
ulU 


Tip 

lie 


Lys 


Asn 


m n 

\J-L U. 


ys 


Leu 


His 


305 










310 










Jib 










32 0 


Gin 


Gin 


Asp 


He 


Lys 


Leu 


Met 


Lys 


Met 


blU 


Lys 


Leu 


vai 


Asp 


Asn 


Asn 










325 










3 3 0 










J J J 




Thr 


Ala 


Leu 


Val 


Asp 


Lys 


Leu 


Lys 


Arg 


Phe 


Gin 


bin 


blU 


Asn 


blU 


bl u 








340 










345 










J b u 






Leu 


Lys 


Ala 


Arg 


Met 


Asp 


Lys 


His 


Met 


Ala 


He 


Ser 


Arg 


Gin 


Leu 


Ser 






355 










3 60 










3 65 








Thr 


Glu 


Gin 


Ala 


Val 


Leu 


Gin 


Glu 


Ser 


Leu 


Glu 


Lys 


Glu 


Ser 


Lys 


Val 




370 










375 










380 










Asn 


Lys 


Arg 


Leu 


Ser 


Met 


Glu 


Asn 


Glu 


Glu 


Leu 


Leu 


Trp 


Lys 


Leu 


His 


385 










390 










395 










400 


Asn 


Gly 


Asp 


Leu 


Cys 


Ser 


Pro 


Lys 


Arg 


Ser 


Pro 


Thr 


Ser 


Ser 


Ala 


He 










405 










410 










415 




Pro 


Leu 


Gin 


Ser 


Pro 


Arg 


Asn 


Ser 


Gly 


Ser 


Phe 


Pro 


Ser 


Pro 


Ser 


He 








420 










425 










430 







Ser Pro Arg 
435 

<210> 11 
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<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 
<400> 11 

cgcggatccc agacagaccg gacggaactg gag 33 

<210> 12 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide primer 

<400> 12 

ccggaattca ctacaacctt tcgtttaaag catc 34 



/ 
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fladtEOKTO 19 APR 2001 

1 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE-CNRS 

(B) STREET: 3 rue Michel -Ange 

(C) :CITY: Paris 

(E) COUNTRY: FRANCE 

(F) POSTAL CODE: 75794 PARIS Cedex 16 

(A) NAME: ELBAZ Nathalie 

(B) STREET: 7 Passage des Italiens 

(C) CITY: Bagnolet 

( E ) COUNTRY : FRANCE 

(F) - POSTAL CODE: 93170 

(A) NAME: NAHM I AS Clara 

(B) STREET: 4 rue Bailly 

(C) CITY: Paris 

( E ) COUNTRY : FRANCE 

(F) POSTAL CODE: 75003 

(A) NAME: STROSBERG Arthur Donny 

(B) STREET: 66 rue de Javel 

(C) CITY: Paris 

(E) COUNTRY: FRANCE 

(F) POSTAL CODE: 75015 

(ii) TITLE OF THE INVENTION: NUCLEIC SEQUENCES ENCODING AN AT2 
RECEPTOR - INTERACTING PROTEIN (ATIP) AND THEIR APPLICATIONS 



(iii) NUMBER OF SEQUENCES : 12 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1803 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 178. .1500 



2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GCTACCCCCC CCCCACGCAC CCCCCAATCT GGGTGGCCTG GCATTAGCAT GTAAGCTTGT 60 

TTTTCTCTGG CTGTATCTCT TGGCCTGGAA GAACCCCGAG TTGCCAAGAG ACACAGTATG 12 0 

TGATGGTCCC TGGAAAAGCT GCTTCCCCTG CGAAGTTCTC CCACTGGCTT CGAAGAC 177 

ATG CTG TTG TCT CCC AAA TTC TCC TTA TCC ACC ATC CAC GTC CGC CTA 22 5 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr lie His Val Arg Leu 
1 5 10 15 

ACC GCC AAA GGA CTG CTT CGA AAC CTC CGG CTT CCT TCG GGG CTC AGG 273 
Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Leu Arg 
20 25 30 

AAA AAC ACT GTC ATT TTC CAC ACA GTT GAA AAG GGC AGG CAG AAG AAT 321 
Lys Asn Thr Val He Phe His Thr Val Glu Lys Gly Arg Gin Lys Asn 
35 40 45 

CCC AGG AGC CTG TGC ATC CAG ACC CAG ACA GCT CCA GAT GTG CTG TCC 369 
Pro Arg Ser Leu Cys He Gin Thr Gin Thr Ala Pro Asp Val Leu Ser 
50 55 60 

TCC GAG AGA ACG CTT GAG TTG GCC CAA TAC AAG ACA AAA TGT GAA AGC 417 
Ser Glu Arg Thr Leu Glu Leu Ala Gin Tyr Lys Thr Lys Cys Glu Ser 
65 70 75 80 

CAA AGT GGA TTC ATC CTG CAC CTC AGG CAG CTT CTT TCC CGT GGT AAC 465 
Gin Ser Gly Phe He Leu His Leu Arg Gin Leu Leu Ser Arg Gly Asn 
85 90 95 

AAC AAG TTT GAA GCG CTG ACA GTT GTG ATC CAG CAC CTC CTG TCT GAG 513 
Asn Lys Phe Glu Ala Leu Thr Val Val He Gin His Leu Leu Ser Glu 
100 105 110 

CGG GAG GAA GCA CTG AAG CAA CAC AAA ACC CTC TCT CAA GAA CTT GTC 561 
Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 
115 120 125 

AGC CTC CGG GGA GAG CTA GTT GCT GCT TCA AGC GCC TGT GAG AAG CTA 609 
Ser Leu Arg Gly Glu Leu Val Ala Ala Ser Ser Ala Cys Glu Lys Leu 
130 135 140 

GAA AAG GCT AGG GCT GAC TTA CAG ACA GCG TAT CAA GAA TTT GTC CAG 65 7 

Glu Lys Ala Arg Ala Asp Leu Gin Thr Ala Tyr Gin Glu Phe Val Gin 
145 150 155 160 

AAA CTA AAC CAG CAG CAT CAG ACA GAC CGG ACG GAA CTG GAG AAC CGG 705 
Lys Leu Asn Gin Gin His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg 
165 170 175 



CTG AAG GAC TTA TAC ACC GCA GAG TGT GAG AAG CTT CAG AGC ATT TAC 
Leu Lys Asp Leu Tyr Thr Ala Glu Cys Glu Lys Leu Gin Ser He Tyr 
180 185 190 



753 
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ATT GAG GAG GCA GAA AAA TAT AAA ACT CAA CTG CAA GAG CAG TTT GAC 801 
lie Glu Glu Ala Glu Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp 
195 200 205 

AAC TTA AAC GCC GCC CAT GAG ACC ACT AAG CTT GAG ATT GAA GCT AGC 84 9 

Asn Leu Asn Ala Ala His Glu Thr Thr Lys Leu Glu He Glu Ala Ser 
210 215 ^ 220 

CAC TCG GAG AAG GTG GAA TTG CTG AAG AAG ACC TAT GAA ACC TCC CTT 8 97 

His Ser Glu Lys Val Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu 
225 230 235 240 

TCA GAA ATC AAG AAG AGC CAT GAG ATG GAG AAG AAG TCA CTG GAG GAT 94 5 

Ser Glu He Lys Lys Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp 
245 250 255 

CTG CTT AAT GAG AAG CAG GAA TCG CTG GAG AAA CAA ATC AAT GAT CTG 993 
Leu Leu Asn Glu Lys Gin. Glu Ser Leu Glu Lys Gin He Asn Asp Leu 
260 265 270 

AAG AGT GAA AAC GAT GCT TTA AAC GAA AGG TTG AAA TCA GAG GAG CAA 1041 
Lys Ser Glu Asn Asp Ala Leu Asn Glu Arg Leu Lys Ser Glu Glu Gin 
275 280 285 

AAG CAA CTG TCA AGA GAG AAG GCG AAT TCC AAA AAC CCT CAG GTC ATG 108 9 

Lys Gin Leu Ser Arg Glu Lys Ala Asn Ser Lys Asn Pro Gin Val Met 
290 295 300 

TAT CTG GAG CAA GAA CTA GAA AGC CTG AAG GCT GTG TTA GAG ATC AAG 1137 
Tyr Leu Glu Gin Glu Leu Glu Ser Leu Lys Ala Val Leu Glu He Lys 
305 310 315 320 

AAT GAG AAG CTG CAC CAG CAG GAC ATG AAG CTA ATG AAG ATG GAA AAG 1185 
Asn Glu Lys Leu His Gin Gin Asp Met Lys Leu Met Lys Met Glu Lys 
325 330 335 

CTG GTG GAC AAT AAC ACA GCA TTG GTT GAC AAG CTG AAG CGA TTC CAG 123 3 

Leu Val Asp Asn Asn Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin 
340 345 350 

CAG GAA AAC GAG GAG TTA AAA GCT CGC ATG GAC AAA CAC ATG GCA ATT 1281 
Gin Glu Asn Glu Glu Leu Lys Ala Arg Met Asp Lys His Met Ala He 
355 360 365 

TCA AGG CAA CTT TCC ACC GAG CAG GCC GCG CTG CAA GAG TCC CTT GAG 13 2 9 

Ser Arg Gin Leu Ser Thr Glu Gin Ala Ala Leu Gin Glu Ser Leu Glu 
370 375 380 

AAG GAG TCA AAG GTC AAC AAG AGA CTG TCC ATG GAG AAC GAG GAA CTT 13 77 

Lys Glu Ser Lys Val Asn Lys Arg Leu Ser Met Glu Asn Glu Glu Leu 
385 390 395 400 

CTG TGG AAA CTG CAC AAC GGA GAC CTG TGC AGC CCC AAG AGA TCC CCC 1425 
Leu Trp Lys Leu His Asn Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro 
405 410 415 
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ACC TCC TCG GCC ATC CCT TTC CAG TCC CCC AGG AAT TCT GGT TCC TTC 14 73 

Thr Ser Ser Ala lie Pro Phe Gin Ser Pro Arg Asn Ser Gly Ser Phe 
420 425 430 

TCC AGC CCC AGC ATC TCA CCC AGA TGA CGGCTTCTGA ACGCAGGAGA 1520 
Ser Ser Pro Ser lie Ser Pro Arg * 
435 440 

CTCTCTGAAG GCACTGAGGT GCGCTTCTGC AGGACTGACC CTCTCATGGG AACTCGAGTT 15 80 

GCTGCGTTAG CTCTCTGGAA TATCCCCAGG ATATCGGGAG AGCAGCCGCC AACCGTATCA 164 0 

GCTACGTACG AATAGAGAGC TCCAATAGAA GACTTTTAAC TTGGTCCAAA AGCCTCCTCC 1700 

AAAAACAGAT TTCGGAACTG AAGTGGACAT AGTTGCACAA AGCACTTACG GAACGAGGGA 1760 

ACCTTGTTCT TTGCCTTCCT TCACCTAAGC ATAGGCTTTC CAG 1803 

(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr He His Val Arg Leu 
1 5 10 15 < 

Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Leu Arg 
20 25 30 

Lys Asn Thr Val He Phe His Thr Val Glu Lys Gly Arg Gin Lys Asn 
35 40 45 

Pro Arg Ser Leu Cys He Gin Thr Gin Thr Ala Pro Asp Val Leu Ser 
50 55 60 

Ser Glu Arg Thr Leu Glu Leu Ala Gin Tyr Lys Thr Lys Cys Glu Ser 
65 70 75 80 

Gin Ser Gly Phe He Leu His Leu Arg Gin Leu Leu Ser Arg Gly Asn 
85 90 95 

Asn Lys Phe Glu Ala Leu Thr Val Val lie Gin His Leu Leu Ser Glu 
100 105 HO 

Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 
115 120 125 



Ser Leu 
130 



Arg Gly Glu Leu Val Ala 
135 



Ala Ser Ser Ala Cys Glu Lys Leu 
140 
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Glu Lys Ala Arg Ala Asp Leu Gin Thr Ala Tyr Gin Glu Phe Val Gin 
145 150 155 160 

Lys Leu Asn Gin Gin His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg 
165 170 175 

Leu Lys Asp Leu Tyr Thr Ala Glu Cys Glu Lys Leu Gin Ser He Tyr 
180 185 190 

He Glu Glu Ala Glu Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp 
195 200 205 

Asn Leu Asn Ala Ala His Glu Thr Thr Lys Leu Glu He Glu Ala Ser 
210 215 220 

His Ser Glu Lys Val Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu 
225 230 235 240 

Ser Glu He Lys Lys Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp 
245 250 255 

Leu Leu Asn Glu Lys Gin Glu Ser Leu Glu Lys Gin He Asn Asp Leu 
260 265 270 

Lys Ser Glu Asn Asp Ala Leu Asn Glu Arg Leu Lys Ser Glu Glu Gin 
275 280 285 

Lys Gin Leu Ser Arg Glu Lys Ala Asn Ser Lys Asn Pro Gin Val Met 
290 295 300 

Tyr Leu Glu Gin Glu Leu Glu Ser Leu Lys Ala Val Leu Glu He Lys 
305 310 315 320 

Asn Glu Lys Leu His Gin Gin Asp Met Lys Leu Met Lys Met Glu Lys 
325 330 335 

Leu Val Asp Asn Asn Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin 
340 345 350 

Gin Glu Asn Glu Glu Leu Lys Ala Arg Met Asp Lys His Met Ala He 
355 360 365 

Ser Arg Gin Leu Ser Thr Glu Gin Ala Ala Leu Gin Glu Ser Leu Glu 
370 375 380 

Lys Glu Ser Lys Val Asn Lys Arg Leu Ser Met Glu Asn Glu Glu Leu 
385 390 395 400 

Leu Trp Lys Leu His Asn Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro 
405 410 415 

Thr Ser Ser Ala He Pro Phe Gin Ser Pro Arg Asn Ser Gly Ser Phe 
420 425 430 

Ser Ser Pro Ser He Ser Pro Arg * 
435 440 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1323 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION : 1 . .1322 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG CTG TTG TCT CCC AAA TTC TCC TTA TCC ACC ATC CAC GTC CGC CTA 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr lie His Val Arg Leu 
445 450 455 

ACC GCC AAA GGA CTG CTT CGA AAC CTC CGG CTT CCT TCG GGG CTC AGG 

Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Leu Arg 

460 465 470 

AAA AAC ACT GTC ATT TTC CAC ACA GTT GAA AAG GGC AGG CAG AAG AAT 

Lys Asn Thr Val He Phe His Thr Val Glu Lys Gly Arg Gin Lys Asn 



CCC AGG AGC CTG TGC ATC CAG ACC CAG ACA GCT CCA GAT GTG CTG TCC 
Pro Arg Ser Leu Cys He Gin Thr Gin Thr Ala Pro Asp Val Leu Ser 



TCC GAG AGA ACG CTT GAG TTG GCC CAA TAC AAG ACA AAA TGT GAA AGC 
Ser Glu Arg Thr Leu Glu Leu Ala Gin Tyr Lys Thr Lys Cys Glu Ser 

CAA AGT GGA TTC ATC CTG CAC CTC AGG CAG CTT CTT TCC CGT GGT AAC 
Gin Ser Gly Phe He Leu His Leu Arg Gin Leu Leu Ser Arg Gly Asn 



AAC AAG TTT GAA GCG CTG ACA GTT GTG ATC CAG CAC CTG CTG TCT GAG 
Asn Lys Phe Glu Ala Leu Thr Val Val He Gin His Leu Leu Ser Glu 



CGG GAG GAA GCA CTG AAG CAA CAC AAA ACC CTC TCT CAA GAA CTT GTC 
Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 



AGC CTC CGG GGA GAG CTA GTT GCT GCT TCA AGC GCC TGT GAG AAG CTA 
Ser Leu Arg Gly Glu Leu Val Ala Ala Ser Ser Ala Cys Glu Lys Leu 



GAA AAG GCT AGG GCT GAC TTA CAG ACA GCG TAT CAA GAA TTT GTC CAG 
Glu Lys Ala Arg Ala Asp Leu Gin Thr Ala Tyr Gin Glu Phe Val Gin 



OM3 :7.&.e ,1/9 * , xo-.o e o :i 



AAA CTA AAC CAG CAG CAT CAG ACA GAC CGG ACG GAA CTG GAG AAC CGG 52 8 

Lys Leu Asn Gin Gin His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg 

CTG AAG GAC TTA TAC ACC GCA GAG TGT GAG AAG CTT CAG AGC ATT TAC 576 
Leu Lys Asp Leu Tyr Thr Ala Glu Cys Glu Lys Leu Gin Ser lie Tyr 

ATT GAG GAG GCA GAA AAA TAT AAA ACT CAA CTG CAA GAG CAG TTT GAC 624 
He Glu Glu Ala Glu Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp 

AAC TTA AAC GCC GCC CAT GAG ACC ACT AAG CTT GAG ATT GAA GCT AGC 672 
Asn Leu Asn Ala Ala His Glu Thr Thr Lys Leu Glu He Glu Ala Ser 

CAC TCG GAG AAG GTG GAA TTG CTG AAG AAG ACC TAT GAA ACC TCC CTT 720 
His Ser Glu Lys Val Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu 

TCA GAA ATC AAG AAG AGC CAT GAG ATG GAG AAG AAG TCA CTG GAG GAT 76 8 

Ser Glu He Lys Lys Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp 

CTG CTT AAT GAG AAG CAG GAA TCG CTG GAG AAA CAA ATC AAT GAT CTG 816 
Leu Leu Asn Glu Lys Gin Glu Ser Leu Glu Lys Gin He Asn Asp Leu 

AAG AGT GAA AAC GAT GCT TTA AAC GAA AGG TTG AAA TCA GAG GAG CAA 864 
Lys Ser Glu Asn Asp Ala Leu Asn Glu Arg Leu Lys Ser Glu Glu Gin 

AAG CAA CTG TCA AGA GAG AAG GCG AAT TCC AAA AAC CCT CAG GTC ATG 912 
Lys Gin Leu Ser Arg Glu Lys Ala Asn Ser Lys Asn Pro Gin Val Met 

TAT CTG GAG CAA GAA CTA GAA AGC CTG AAG GCT GTG TTA GAG ATC AAG 960 
Tyr Leu Glu Gin Glu Leu Glu Ser Leu Lys Ala Val Leu Glu He Lys 

AAT GAG AAG CTG CAC CAG CAG GAC ATG AAG CTA ATG AAG ATG GAA AAG 1008 
Asn Glu Lys Leu His Gin Gin Asp Met Lys Leu Met Lys Met Glu Lys 

, CTG GTG GAC AAT AAC ACA GCA TTG GTT GAC AAG CTG AAG CGA TTC CAG 1056 
Leu Val Asp Asn Asn Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin 

CAG GAA AAC GAG GAG TTA AAA GCT CGC ATG GAC AAA CAC ATG GCA ATT 1104 
Gin Glu Asn Glu Glu Leu Lys Ala Arg Met Asp Lys His Met Ala He 



TCA AGG CAA CTT TCC ACC GAG CAG GCC GCG CTG CAA GAG TCC CTT GAG 
Ser Arg Gin Leu Ser Thr Glu Gin Ala Ala Leu Gin Glu Ser Leu Glu 
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AAG GAG TCA AAG GTC AAC AAG AGA CTG TCC ATG GAG AAC GAG GAA CTT 12 00 

Lys Glu Ser Lys Val Asn Lys Arg Leu Ser Met Glu Asn Glu Glu Leu 

CTG TGG AAA CTG CAC AAC GGA GAC CTG TGC AGC CCC AAG AGA TCC CCC 124 8 

Leu Trp Lys Leu His Asn Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro 

ACC TCC TCG GCC ATC CCT TTC CAG TCC CCC AGG AAT TCT GGT TCC TTC 12 96 

Thr Ser Ser Ala He Pro Phe Gin Ser Pro Arg Asn Ser Gly Ser Phe 

TCC AGC CCC AGC ATC TCA CCC AGA TG A 1323 
Ser Ser Pro Ser lie Ser Pro Arg 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr He His Val Arg Leu 
15 10 15 

Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Leu Arg 
20 25 30 

Lys Asn Thr Val He Phe His Thr Val Glu Lys Gly Arg Gin Lys Asn 
35 40 45 

Pro Arg Ser Leu Cys He Gin Thr Gin Thr Ala Pro Asp Val Leu Ser 
50 55 60 

Ser Glu Arg Thr Leu Glu Leu Ala Gin Tyr Lys Thr Lys Cys Glu Ser 
65 70 75 80 

Gin Ser Gly Phe He Leu His Leu Arg Gin Leu Leu Ser Arg Gly Asn 
85 90 95 

Asn Lys Phe Glu Ala Leu Thr Val Val He Gin His Leu Leu Ser Glu 
100 105 HO 

Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 
115 120 125 

Ser Leu Arg Gly Glu Leu Val Ala Ala Ser Ser Ala Cys Glu Lys Leu 
130 135 140 

Glu Lys Ala Arg Ala Asp Leu Gin Thr Ala Tyr Gin Glu Phe Val Gin 
145 150 155 160 



Lys Leu Asn Gin 



Leu Lys Asp Leu 
180 

lie Glu Glu Ala 
195 

Asn Leu Asn Ala 
210 

His Ser Glu Lys 
225 

Ser Glu lie Lys 



Leu Leu Asn Glu 
260 

Lys Ser Glu Asn 
275 

Lys Gin Leu Ser 
290 

Tyr Leu Glu Gin 
305 



Gin His Gin Thr 
165 

Tyr Thr Ala Glu 



Glu Lys Tyr Lys 
200 

Ala His Glu Thr 
215 

Val Glu Leu Leu 
230 

Lys Ser His Glu 
245 

Lys Gin Glu Ser 



Asp Ala Leu Asn 
280 

Arg Glu Lys Ala 
295 

Glu Leu Glu Ser 
310 



Asp Arg Thr Glu 
170 

Cys Glu Lys Leu 
185 

Thr Gin Leu Gin 



Thr Lys Leu Glu 
220 

Lys Lys Thr Tyr 
235 

Met Glu Lys Lys 
250 

Leu Glu Lys Gin 
265 

Glu Arg Leu Lys 



Asn Ser Lys Asn 
300 

Leu Lys Ala Val 
315 



Leu Glu Asn Arg 
175 

Gin Ser lie Tyr 
190 

Glu Gin Phe Asp 
205 

lie Glu Ala Ser 



Glu Thr Ser Leu 
240 

Ser Leu Glu Asp 
255 

lie Asn Asp Leu 
270 

Ser Glu Glu Gin 
285 

Pro Gin Val Met 



Leu Glu lie Lys 
320 



Asn Glu Lys Leu 



Leu Val Asp Asn 
340 

Gin Glu Asn Glu 
355 

Ser Arg Gin Leu 
370 

Lys Glu Ser Lys 
385 

Leu Trp Lys Leu 



Thr Ser Ser Ala 
420 



His Gin Gin Asp 
325 

Asn Thr Ala Leu 



Glu Leu Lys Ala 
360 

Ser Thr Glu Gin 
375 

Val Asn Lys Arg 
390 

His Asn Gly Asp 
405 

lie Pro Phe Gin 



Met Lys Leu Met 
330 

Val Asp Lys Leu 
345 

Arg Met Asp Lys 



Ala Ala Leu Gin 
380 

Leu Ser Met Glu 
395 

Leu Cys Ser Pro 
410 

Ser Pro Arg Asn 
425 



Lys Met Glu Lys 
335 

Lys Arg Phe Gin 
350 

His Met Ala lie 
365 

Glu Ser Leu Glu 



Asn Glu Glu Leu 
400 

Lys Arg Ser Pro 
415 

Ser Gly Ser Phe 
430 



Ser Ser Pro Ser He Ser Pro Arg 
435 440 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 54 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1. .354 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 



CAT CAG ACA GAC CGG ACG GAA CTG GAG AAC CGG CTG AAG GAC TTA TAC 4 8 

His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg Leu Lys Asp Leu Tyr 
440 445 450 



ACC GCA GAG TGT GAG AAG CTT CAG AGC ATT TAC ATT GAG GAG GCA GAA 
Thr Ala Glu Cys Glu Lys Leu Gin Ser He Tyr He Glu Glu Ala Glu 
455 460 465 
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AAA TAT AAA ACT CAA CTG CAA GAG CAG TTT GAC AAC TTA AAC GCC GCC 144 
Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp Asn Leu Asn Ala Ala 
470 475 480 

CAT GAG ACC ACT AAG CTT GAG ATT GAA GCT AGC CAC TCG GAG AAG GTG 192 
His Glu Thr Thr Lys Leu Glu He Glu Ala Ser His Ser Glu Lys Val 
485 490 495 500 

GAA TTG CTG AAG AAG ACC TAT GAA ACC TCC CTT TCA GAA ATC AAG AAG 24 0 

Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu Ser Glu He Lys Lys 
505 510 515 

AGC CAT GAG ATG GAG AAG AAG TCA CTG GAG GAT CTG CTT AAT GAG AAG 28 8 

Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp Leu Leu Asn Glu Lys 
520 525 . 530 

CAG GAA TCG CTG GAG AAA CAA ATC AAT GAT CTG AAG AGT GAA AAC GAT 336 
Gin Glu Ser Leu Glu Lys Gin lie Asn Asp Leu Lys Ser Glu Asn Asp 
535 540 545 

GCT TTA AAC GAA AGG TTG 354 
Ala Leu Asn Glu Arg Leu 
550 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

His Gin Thr Asp Arg Thr Glu Leu Glu Asn Arg Leu Lys Asp Leu Tyr 
1 5 10 15 

Thr Ala Glu Cys Glu Lys Leu Gin Ser lie Tyr He Glu Glu Ala Glu 
20 25 30 

Lys Tyr Lys Thr Gin Leu Gin Glu Gin Phe Asp Asn Leu Asn Ala Ala 
35 40 45 

His Glu Thr Thr Lys Leu Glu He Glu Ala Ser His Ser Glu Lys Val 
50 55 60 

Glu Leu Leu Lys Lys Thr Tyr Glu Thr Ser Leu Ser Glu He Lys Lys 
65 70 75 80 

Ser His Glu Met Glu Lys Lys Ser Leu Glu Asp Leu Leu Asn Glu Lys 
85 90 95 

Gin Glu Ser Leu Glu Lys Gin He Asn Asp Leu Lys Ser Glu Asn Asp 
100 105 HO 

Ala Leu Asn Glu Arg Leu 
115 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3742 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 293 .. 1600 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CAGTGTGATG TGGTTCAGAG GCAGCTTCTA GACCTGCAGG AGGGAGATTG TATTCAGAGG 6 0 

AAGAGCATCA TTTTGGCAAC ATCTGAAAGT GAAAACGGAA GCCAGAAACA CTTGGCCAGC 12 0 

CCTGGGGGAT TTTTTTCTTC TATGCCTCTG TGGTGGAATG ACATTTGCTG TGTAGGCATC 18 0 

TTTCCTCTGA CTGTATTTCT TGGCCTTGAA GAGTACTGAG TTTAAAAAGA CAGTATGTGA 24 0 

CAGTCCATGG AAATTGCCTC TTCTGTGAAA TCTCGCCACC TGCTCCGAAG AC ATG 2 95 

Met 
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TTG TTG TCT CCC AAA TTC TCC TTA TCC 
Leu Leu Ser Pro Lys Phe Ser Leu Ser 



GCC AAA GGA TTG CTT CGA AAC CTT CGA 
Ala Lys Gly Leu Leu Arg Asn Leu Arg 



AGC ACT GTT GTT TTC CAC ACA GTT GAA 
Ser Thr Val Val Phe His Thr Val Glu 



CGA AGC TTA TGT ATC CAG CCA CAG ACA 
Arg Ser Leu Cys lie Gin Pro Gin Thr 



GAG AAA ACA CTT GAA TTG ACG CAA TAT 
Glu Lys Thr Leu Glu Leu Thr Gin Tyr 



AGT GGA TTT ATC CTG CAG CTC AAG CAG 
Ser Gly Phe lie Leu Gin Leu Lys Gin 



AAG TTT GAG GCA TTG ACA GTT GTG ATT 
Lys Phe Glu Ala Leu Thr Val Val He 



GAG GAA GCA CTG AAA CAA CAC AAA ACC 
Glu Glu Ala Leu Lys Gin His Lys Thr 



CTC CGG GGA GAG CTA GTC ACT GCT TCA 
Leu Arg Gly Glu Leu Val Thr Ala Ser 



AAA GCC AGG AAT GAG TTA CAA ACA GTG 
Lys Ala Arg Asn Glu Leu Gin Thr Val 



CAC CAG GCT GAA AAA ACA GAA CGA GAG 
His Gin Ala Glu Lys Thr Glu Arg Glu 



ACC AGG GAG TAT GAA AAG CTT CGG GAC 
Thr Arg Glu Tyr Glu Lys Leu Arg Asp 



AAG TAC AAA ATG CAA TTG CAA GAG CAG 
Lys Tyr Lys Met Gin Leu Gin Glu Gin 



GAA ACC TCT AAG TTG GAA ATT GAA GCT 
Glu Thr Ser Lys Leu Glu He Glu Ala 



TTG CTA AAG AAG GCC TAT GAA GCC TCC 
Leu Leu Lys Lys Ala Tyr Glu Ala Ser 
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ACC ATT CAC ATA CGA CTG ACG 34 3 

Thr He His He Arg Leu Thr 



CTT CCT TCA GGG TTT AGG AGA 3 91 

Leu Pro Ser Gly Phe Arg Arg 



AAG AGC AGG CAA AAG AAT CCT 43 9 

Lys Ser Arg Gin Lys Asn Pro 



GCT CCC GAT GCG CTG CCC CCT 4 87 

Ala Pro Asp Ala Leu Pro Pro 



AAA ACA AAA TGT GAA AAC CAA 53 5 

Lys Thr Lys Cys Glu Asn Gin 



CTT CTT GCC TGT GGT AAT ACC 5 83 

Leu Leu Ala Cys Gly Asn Thr 



CAG CAC CTG CTG TCT GAG CGG 631 
Gin His Leu Leu Ser Glu Arg 



CTA TCT CAA GAA CTT GTT AAC 67 9 

Leu Ser Gin Glu Leu Val Asn 



ACC ACC TGT GAG AAA TTA GAA 727 
Thr Thr Cys Glu Lys Leu Glu 



TAT GAA GCA TTC GTC CAG CAG 775 
Tyr Glu Ala Phe Val Gin Gin 



AAT CGG CTT AAA GAG TTT TAC 82 3 

Asn Arg Leu Lys Glu Phe Tyr 



ACT TAC ATT GAA GAA GCA GAG 871 
Thr Tyr lie Glu Glu Ala Glu 

TTT GAC AAC TTA AAT GCG CAT 919 
Phe Asp Asn Leu Asn Ala His 

AGC CAC TCA GAG AAA CTT GAA 96 7 
Ser His Ser Glu Lys Leu Glu 



CTT TCA GAA ATT AAG AAA GGC 1015 
Leu Ser Glu He Lys Lys Gly 



13 



CAT GAA ATA GAA AAG AAA TCG CTT GAA GAT TTA CTT TCT GAG AAG CAG 1063 
His Glu He Glu Lys Lys Ser Leu Glu Asp Leu Leu Ser Glu Lys Gin 

GAA TCG CTA GAG AAG CAA ATC AAT GAT CTG AAG AGT GAA AAT GAT GCT 1111 
Glu Ser Leu Glu Lys Gin He Asn Asp Leu Lys Ser Glu Asn Asp Ala 

TTA AAT GAA AAA TTG AAA TCA GAA GAA CAA AAA AGA AGA GCA AGA GAA 115 9 

Leu Asn Glu Lys Leu Lys Ser Glu Glu Gin Lys Arg Arg Ala Arg Glu 

AAA GCA AAT TTG AAA AAT CCT CAG ATC ATG TAT CTA GAA CAG GAG TTA 1207 
Lys Ala Asn Leu Lys Asn Pro Gin He Met Tyr Leu Glu Gin Glu Leu 

GAA AGC CTG AAA GCT GTG TTA GAG ATC AAG AAT GAG AAA CTG CAT CAA 125 5 

Glu Ser Leu Lys Ala Val Leu Glu lie Lys Asn Glu Lys Leu His Gin 

CAG GAC ATC AAG TTA ATG AAA ATG GAG AAA CTG GTG GAC AAC AAC AC A 1303 
Gin Asp He Lys Leu Met Lys Met Glu Lys Leu Val Asp Asn Asn Thr 

GCA TTG GTT GAC AAA TTG AAG CGT TTC CAG CAG GAG AAT GAA GAA TTG 1351 
Ala Leu Val Asp Lys Leu Lys. Arg Phe Gin Gin Glu Asn Glu Glu Leu 

AAA GCT CGG ATG GAC AAG CAC ATG GCA ATC TCA AGG CAG CTT TCC ACG 13 99 

Lys Ala Arg Met Asp Lys His Met Ala He Ser Arg Gin Leu Ser Thr 

GAG CAG GCT GTT CTG CAA GAG TCG CTG GAG AAG GAG TCG AAA GTC AAC 1447 
Glu Gin Ala Val Leu Gin Glu Ser Leu Glu Lys Glu Ser Lys Val Asn 

AAG CGA CTC TCT ATG GAA AAC GAG GAG CTT CTG TGG AAA CTG CAC AAT 1495 
Lys Arg Leu Ser Met Glu Asn Glu Glu Leu Leu Trp Lys Leu His Asn 

GGG GAC CTG TGT AGC CCC AAG AGA TCC CCC ACA TCC TCC GCC ATC CCT 154 3 

Gly Asp Leu Cys Ser Pro Lys Arg Ser Pro Thr Ser Ser Ala He Pro 

TTG CAG TCA CCA AGG AAT TCG GGC TCC TTC CCT AGC CCC AGC ATT TCA 1591 
Leu Gin Ser Pro Arg Asn Ser Gly Ser Phe Pro Ser Pro Ser He Ser 

CCC AGA TGA CACGTCCCCA AAGTC CAC AG ACTCTCTGAA AGCATTTTGA 164 0 
Pro Arg * 

TGCAGGTCTG CAGGACTGAC CCCAAGGAGG AACGTGGGCA CAAGAGGTAT ATCAGCACAC 1700 

GTGTGATCAC CGTAGGTAAC TGGAGCGTCA CCACCGGCGG AATCGAGCTT CTGAGACTGG 176 0 
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AAGTCTGGAG GAAGACTTTT GCCTCCGTCC AAAAGATTCC TCCAAAAAAA GATTTAAAAA 182 0 

AAGATTTCGG CATCGACACG GACGTTGTTG CACAAAGCAC TTAAAGAACG AGAGCATCTT 188 0 

GTTCATTGCC TTTTTCACCT AAGCATAAGG GGAAAAACTC TCAGGGCCCT ATTAAGATTT 194 0 

ATAACCTTTG TAATGTTCTT CACCACAGAC ACCTTCTTGT GAGTTTTCAG TCTGACTGTG 2000 

GGGGTGGGGG GTGTGAATGA AATGGATGTC ACAGAGTGTC ATGTGTCTGA TGCAGCCTCC 206 0 

TCTGCTGTGT ATTAAATGTC AAAATCTGAA TATATCTGGA TATGTACTAA TCAAATAATA 212 0 

ATCAATCAAT CAGCATATAC ATTTCAGCCA AAGCCATAGA AGAAAAAGCA AT AGTTG CTT 218 0 

GAATTATGAT CATCTACCAC CAACTCTGCT CAGCCCTGTA ACAGGGTAGG GAGAGGGTAT 224 0 

AACAGGAAGA GCTTTGACTT GTCCCTGTCT ATACATTCTC TGTATCTTTT GGGGGTAACT 2 3 00 

TCTTGGCAGT TTTTCAGTGT TCAGCCATGT CAGTTGAAAC TAGATTTTTC TGTAGATTTT 23 60 

TTACTTACCC ATGTGAGCCT AAC ACT ATC C TGT AATT CAT TTTCTCAGGC TATGTGTAAA 24 2 0 

TGTAGAACCC TAATTTTTCT ATAAAAAAAC AAACTAACTA ACTGTGTAAA GAAAGAAAAA 24 80 

GGGAAGTACC AATGGGTTTT TCCACCTTAT TTTTACCTTT GATCTACCCT TGCAGATTTA 2 54 0 

ACCTGTCTTC TTCCCTCCCA TTATTCTCAT TTTCCTTTTA CCTTTCTCCA CCATCCAGAG 2600 

CCACAAAAGC AAACCTTCTA CCTCCTACCT ACTTTTCTCT GGGACAAGGA TAAAGGAATA 266 0 

TGATTTTCCA GAGCCCCAGA GCCAGCTCAT CTTCCAGGTG CTGAAAC C AC TTTCCAAATA 272 0 

AACTAAAGCC TGGATTTGAT ATTACAAATT TTGGGAAATC TTAGAATAAA GAACGAGAAC 278 0 

AAGGAAGTCA TTGGCTAGTA TAATTAAGAA AGGTAGGATT CAGTGCTTAC CGATGATGCA 284 0 

GTACTTGATA GAAGAAAACA GTCTGGGAGG ATAGCGCTCA TTTTTCAGTT ACCCTTTAAG 2 900 

GAGTCCCTTT GTCTTTGGGA AAGTAGCAGA ATGGTCCGCT TCTTTCCCAT GAGTGGAAAA 2 96 0 

TGTGGCTTGT CCAACTCTCC TCCAGGTTGC ATTTCAGTTT CTTTCCAAAA CTTATTACCT 302 0 

CCCCTAATCC TGAGACTTTG GAAAAGGTGG AAGGAAGAAC TGTTGCTTTA TCTCCCCCTC 308 0 

CCTGCATGTG TCAACATTGT GATGTCAGTA TTTACTAATC TACATTCAGT GGCTGTACAA 31.4 0 

ATAACAGCTG TAGTAAGAAG AGATTCAGGA TGCTAGAGGT GAATATTTGG GTCATTTACA 32 0 0 

TGTACACTAC ATAGCAAGTT GATACTCATG TTGCATGTTC TTTTAAATTA GTGATTTTGT 326 0 

GTCTTAAGTC TTTAACTTCC AATACTTCAT CATGTATGTA ACCTTCCATG TTTGCTTCTG 3 32 0 

ATAAATGGAA ATGTAGGTTC ACTGCCACTT CATGAGATAT CTCTGCTCAC GCTTCCAAGT 3 3 80 

TGTTCTCAAT GACATTAGCC AAAGTTGGGT TTGCCATTCA TCCCCTAGGC ATGGTAAATC 344 0 

TTGTGTTGTT CCCTGCTGTC CTCCGTATTA CGTGACCGGC AAATAAATCT CATAGCAGTT 3 500 
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AATATAAAAC ATCTTTGGAG GATGGGAGAG AACAGGAGGG AAGATGGGAA ACAAAATAGA 3 560 

GAATTCTTAA GATTTTGTTT AAACCAAATG TTTCATGTAG AATGCAAAAT GTTGGCACGT 3 620 

CAAAAATATG AATGTGTAGA CAACTGTAGT TGTGCTCAGT TTGTAGTGAT GGGAAGTGTA 3 6 80 

TTTTACTCTG ATCAAATAAA TAATGCTGGA ATACTCAAAA AAAAAAAAAA AAAAAAAAAA 3 74 0 
AA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 435 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Leu Ser Pro Lys Phe Ser Leu Ser Thr lie His lie Arg Leu 
15 10 15 

Thr Ala Lys Gly Leu Leu Arg Asn Leu Arg Leu Pro Ser Gly Phe Arg 
20 25 30 

Arg Ser Thr Val Val Phe His Thr Val Glu Lys Ser Arg Gin Lys Asn 
35 40 45 

Pro Arg Ser Leu Cys lie Gin Pro Gin Thr Ala Pro Asp Ala Leu Pro 
50 55 60 

Pro Glu Lys Thr Leu Glu Leu Thr Gin Tyr Lys Thr Lys Cys Glu Asn 
65 70 75 80 

Gin Ser Gly Phe lie Leu Gin Leu Lys Gin Leu Leu Ala Cys Gly Asn 
85 90 95 

Thr Lys Phe Glu Ala Leu Thr Val Val He Gin His Leu Leu Ser Glu 
100 105 HO 

Arg Glu Glu Ala Leu Lys Gin His Lys Thr Leu Ser Gin Glu Leu Val 
115 120 125 

Asn Leu Arg Gly Glu Leu Val Thr Ala Ser Thr Thr Cys Glu Lys Leu 
130 135 140 

Glu Lys Ala Arg Asn Glu Leu Gin Thr Val Tyr Glu Ala Phe Val Gin 
145 150 155 160 

Gin His Gin Ala Glu Lys Thr Glu Arg Glu Asn Arg Leu Lys Glu Phe 
165 170 175 

Tyr Thr Arg Glu Tyr Glu Lys Leu Arg Asp Thr Tyr He Glu Glu Ala 
180 185 190 
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Glu Lys Tyr Lys Met Gin Leu Gin Glu Gin Phe Asp Asn Leu Asn Ala 
195 200 205 

His Glu Thr Ser Lys Leu Glu lie Glu Ala Ser His Ser Glu Lys Leu 
210 215 220 

Glu Leu Leu Lys Lys Ala Tyr Glu Ala Ser Leu Ser Glu lie Lys Lys 
225 230 235 240 

Gly His Glu He Glu Lys Lys Ser Leu Glu Asp Leu Leu Ser Glu Lys 
245 250 255 

Gin Glu Ser Leu Glu Lys Gin He Asn Asp Leu Lys Ser Glu Asn Asp 
260 265 270 

Ala Leu Asn Glu Lys Leu Lys Ser Glu Glu Gin Lys Arg Arg Ala Arg 
275 280 285 

Glu Lys Ala Asn Leu Lys Asn Pro Gin He Met Tyr Leu Glu Gin Glu 
290 295 300 

Leu Glu Ser Leu Lys Ala Val Leu Glu He Lys Asn Glu Lys Leu His 
305 . 310 315 320 

Gin Gin Asp He Lys Leu Met Lys Met Glu Lys Leu Val Asp Asn Asn 
325 330 335 

Thr Ala Leu Val Asp Lys Leu Lys Arg Phe Gin Gin Glu Asn Glu Glu 
340 345 350 

Leu Lys Ala Arg Met Asp Lys His Met Ala He Ser Arg Gin Leu Ser 
355 360 365 



Thr Glu Gin Ala 
370 

Asn Lys Arg Leu 
385 

Asn Gly Asp Leu 



Pro Leu Gin Ser 
420 



Val Leu Gin Glu 
375 

Ser Met Glu Asn 
390 

Cys Ser Pro Lys 
405 

Pro Arg Asn Ser 



Ser Leu Glu Lys 
380 

Glu Glu Leu Leu 
395 

Arg Ser Pro Thr 
410 

Gly Ser Phe Pro 
425 



Glu Ser Lys Val 



Trp Lys Leu His 
400 

Ser Ser Ala He 
415 

Ser Pro Ser lie 
430 



Ser Pro Arg * 
435 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

ATGTTGTTGT CTCCCAAATT CTCCTTATCC ACCATTCACA TACGACTGAC GGCCAAAGGA 60 

TTGCTTCGAA ACCTTCGACT TCCTTCAGGG TTTAGGAGAA GCACTGTTGT TTTCCACACA 120 

GTTGAAAAGA GCAGGCAAAA GAATCCTCGA AGCTTATGTA TCCAGCCACA GACAGCTCCC 180 

GATGCGCTGC CCCCTGAGAA AACACTTGAA TTGACGCAAT ATAAAACAAA ATGTGAAAAC 24 0 

CAAAGTGGAT TTATCCTGCA GCTCAAGCAG CTTCTTGCCT GTGGTAATAC CAAGTTTGAG 300 

GCATTGACAG TTGTGATTCA GCACCTGCTG TCTGAGCGGG AGGAAGCACT GAAACAACAC 3 60 

AAAACCCTAT CTCAAGAACT TGTTAACCTC CGGGGAGAGC TAGTCACTGC TTCAACCACC 420 

TGTGAGAAAT TAGAAAAAGC CAGGAATGAG TTACAAACAG TGTATGAAGC ATTCGTCCAG 480 

CAGCACCAGG CTGAAAAAAC AGAACGAGAG AATCGGCTTA AAGAGTTTTA CACCAGGGAG 54 0 

TATGAAAAGC TTCGGGACAC TTACATTGAA GAAGCAGAGA AGTACAAAAT GCAATTGCAA 600 

GAGCAGTTTG ACAACTTAAA TGCGCATGAA ACCTCTAAGT TGGAAATTGA AGCTAGCCAC 660 

TCAGAGAAAC TTGAATTGCT AAAGAAGGCC TATGAAGCCT CCCTTTCAGA AATTAAGAAA 720 

GGCCATGAAA TAGAAAAGAA ATCGCTTGAA GATTTACTTT CTGAGAAGCA GGAATCGCTA 780 

GAGAAGCAAA TCAATGATCT GAAGAGTGAA AATGATGCTT TAAATGAAAA ATTGAAATCA 840 

GAAGAACAAA AAAGAAGAGC AAGAGAAAAA GCAAATTTGA AAAATC CTCA GATCATGTAT 900 

CTAGAACAGG AGTTAGAAAG CCTGAAAGCT GTGTTAGAGA TCAAGAATGA GAAACTGCAT 960 

CAACAGGACA TCAAGTTAAT GAAAATGGAG AAACTGGTGG ACAACAACAC AGCATTGGTT 1020 

GACAAATTGA AGCGTTTCCA GCAGGAGAAT GAAGAATTGA AAGCTCGGAT GG AC AAGC AC 1080 

ATGGCAATCT CAAGGCAGCT TTCCACGGAG CAGGCTGTTC TGCAAGAGTC GCTGGAGAAG 114 0 

GAGTCGAAAG TCAACAAGCG ACTCTCTATG GAAAACGAGG AGCTTCTGTG GAAACTGCAC 1200 

AATGGGGACC TGTGTAGCCC CAAGAGATCC CCCACATCCT CCGCCATCCC TTTGCAGTCA 12 60 

CCAAGGAATT CGGGCTCCTT CCCTAGCCCC AGCATTTCAC CCAGATGA 13 08 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAAGCGTTCT CTCGGAGGAC A 21 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CGCGGATCCC AGACAGACCG GACGGAACTG GAG 33 
(2) INFORMATION FORSEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCGGAATTCA CTACAACCTT TCGTTTAAAG CATC 



34 



